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Abstract 


Over the past two decades, fair resource allocation problems have received considerable attention in 
a variety of application areas. However, little progress has been made in the design of distributed algo¬ 
rithms with convergence guarantees for general and commonly used a-fair allocations. In this paper, 
we study weighted a-fair packing problems, that is, the problems of maximizing the objective functions 
(i) I (1 — a) when a > 0, a ^ 1 and (ii) Wj Ina;^ when a = 1, over linear constraints 

Ax < b, X > 0, where Wj are positive weights and A and b are non-negative. We consider the dis¬ 
tributed computation model that was used for packing linear programs and network utility maximization 
problems. Under this model, we provide a distributed algorithm for general a that converges to an 
£—approximate solution in time (number of distributed iterations) that has an inverse polynomial depen¬ 
dence on the approximation parameter e and poly-logarithmic dependence on the problem size. This 
is the first distributed algorithm for weighted a—fair packing with poly-logarithmic convergence in the 
input size. The algorithm uses simple local update rules and is stateless (namely, it allows asynchronous 
updates, is self-stabilizing, and allows incremental and local adjustments). We also obtain a number of 
structural results that characterize a—fair allocations as the value of a is varied. These results deepen 
our understanding of fairness guarantees in a—fair packing allocations, and also provide insight into the 
behavior of a—fair allocations in the asymptotic cases a —0, a —?► 1, and a —oo. 


‘Supported in part by the NSF grant CNS-10-54856 and a Qualcomm Innovation Fellowship, 
t Supported in part by the NSF grants CCF-1349602 and CCF-1421161. 

^Supported in part by the NSF grant CNS-10-54856 and the People Programme (Marie Curie Actions) of the European Union’s 
Seventh Framework Programme (FP7/2007-2013) under REA grant agreement n°[PIIE-GA-2013-629740].ll. 



1 Introduction 


Over the past two decades,/a/r resource allocation problems have received considerable attention in many 
application areas, including Internet congestion control |[32l . rate control in software defined networks |[35l . 
scheduling in wireless networks |[46l . multi-resource allocation and scheduling in datacenters 11121120112111^ . 
and a variety of applications in operations research, economics, and game theory 1111 112311 . In most of these 
applications, positive linear (packing) constraints arise as a natural model of the allowable allocations. 

In this paper, we focus on the problem of finding an a-fair vector on the set determined by packing 
constraints Ax < 1, x > 0 where all Aij > oQ We refer to this problem as a—fair packing. For a vector 
of positive weights w and a > 0, an allocation vector x* of size n is weighted a-fair, if for any alternative 
feasible vector x: < 0 |[38ll . For a compact and convex feasible region, x* can be equivalently 

defined as a vector that solves the problem of maximizing Pa{x) = Yj xujfa{xj) |[38l . where: 


fa{Xj) 


ln(x 




1—a ’ 


if a = 1 
if a 7 ^ 1 


( 1 ) 


a-fairness provides a trade-off between efficiency (sum of allocated resources) and fairness (minimum al¬ 
located resource) as a function of a: the higher the a, the better the fairness guarantees and the lower 
the efficiency ||4l[TTl[3Tl. Important special cases are proportional fairness (a = 1) and max-min fairness 
(a —)• oo). When a = 0, we have the “unfair” case of linear optimization. 

Distributed algorithms for a—fair packing are of particular interest, as many applications are inherently 
distributed (such as, e.g., network congestion control), while in others parallelization is highly desirable due 
to the large problem size (as in, e.g., resource allocation in datacenters). We adopt the model of distributed 
computation commonly used in the design of packing linear programming (LP) algorithms Il3ir7ll8ll29ll33ll42ll 
and which generalizes the model from network congestion control Il26l . In this model, an agent j controls 
the variable xj and has information about: (i) the j* column of the mxn constraint matrix A, (ii) the weight 
Wj, (iii) upper bounds on the global problem parameters m, n, Wynux, and Amax^ where rumax = maxj Wj, 
and Amax = maxjj Aij, and (iv) in each round, the relative slack of each constraint i in which xj takes part. 

Distributed algorithms for a—fair resource allocations have been most widely studied in the network 
congestion control literature, using a control-theoretic approach Il251l261[32ll381l411l4^ . Such an approach 
yields continuous-time algorithms that converge after “finite” time; however, the convergence time of these 
algorithms as a function of the input size is poorly understood. Some other distributed pseudo-polynomial- 
time approximation algorithms that can address a-fair packing are described in Table [U These algorithms 
all have convergence times that are at least linear in the parameters describing the problem. 

No previous work has given truly fast (poly-log iterations) distributed algorithms for the general case of 
a-fair packing. Only for the unfair a = 0 case (packing LPs), are such algorithms known Il^l7l[^l29ll331l47l . 


Our Results. We provide the first efficient, distributed, and stateless algorithm for weighted a-fair 
packing, namely, for the problem max{pQ,(x) : Ax < 1, x > 0}, where distributed agents update the values 
of Xj’s asynchronously and react only to the current state of the constraints. We assume that all non-zero 
entries Aij of matrix A satisfy Aij > 1. Considering such a normalized form of the problem is without loss 
of generality (see Appendix lAl). 

The approximation provided by the algorithm, to which we refer as the e-approximation, is (i) (1 -|- 
e)-multiplicative for a / 1, and (ii) VFe-additiv^l for a = 1, where W = Yj'^j- The main results 
are summarized in the following theorem, where, to unify the statement of the results, we treat a as a 

'Although in the network congestion control literature the constraint matrix A is commonly assumed to be a 0-1 matrix 1251 
1261132113^14111461 . important applications (such as, e.g., multi-resource allocation in datacenters) are modeled by a more general 
constraint matrix A with arbitrary non-negative elements I12II20I[2T1|24I . 

^Note that W cannot be avoided here, as additive approximation is not invariant to the scaling of the objective. 
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constant that is either equal to 1 or bounded away from 0 and 1, and we also loosen the bound in terms of 
e~^,n, m, = maxj ^ Wj/wk, and ^max- For a more detailed statement, see Theorems 14. 1 
Theorem 1.1. (Main Result) For a given weighted a-fair packing problem max{Y^j Wj faixj) : Ax < 
1, X > 0}, where fa (xj) is given by dJl), there exists a stateless and distributed algorithm ( a-FAiRPS OLVER ) 
that computes an e-approximate solution in 0(e~^ In^{RwnmA^six£~^)) rounds. 

To the best of our knowledge, for any constant approximation parameter e, our algorithm is the first 
distributed algorithm for weighted a-fair packing problems with a poly-logarithmic convergence time. 

The algorithm is stateless according to the definition given by Awerbuch and Khandekar IHITl : it starts 
from any initial state, the agents update the variables xj in a cooperative but uncoordinated manner, re¬ 
acting only to the current state of the constraints that they observe, and without access to a global clock. 
Statelessness implies various desirable properties of a distributed algorithm, such as: asynchronous updates, 
self-stabilization, and incremental and local adjustments ||6l|7l. 

We also obtain the following structural results that characterize a—fair packing allocations as a function 
of the value of a: 

• We derive a lower bound on the minimum coordinate of the a—fair packing allocation as a function of a 
and the problem parameters (Lemma r4.29l) . This bound deepens our understanding of how the fairness (a 
minimum allocated value) changes with a. 

• We prove that for a < a—fair packing can be 0(e)—approximated by any e—approximation 

packing LP solver (Lemma r4.30l) . This result allows us to focus on the a > /g) cases. 

• We show that for |a — 1| = 0(e^/ln^(e“^i?^mnAmax))> a—fair allocation is e—approximated by a 
1—fair allocation returned by our algorithm (Lemmas 14.3ll and l4.32l) . 

• We show that for a > InlRyjnA^ax)/£, the a—fair packing allocation x* and the max-min fair allocation 
z* are e-close to each other: (1 — e)z* < x* < (1 -\- £)z* element-wise. This result is especially 
interesting as (i) max-min fair packing is not a convex problem, but rather a multi-objective problem (see, 
e-g-> II271I44II ) and (ii) the result yields the first convex relaxation of max-min fair allocation problems with 
a 1 ± e gap. 

We now overview some of the main technical details of a-FAiRPSOLVER. In doing so, we point out 
connections to the two main bodies of previous work, from packing LPs fT| and network congestion control 
ll25ll . We also outline the new algorithmic ideas and proofs that were needed to obtain the results. 

The algorithm and KKT conditions. The algorithm maintains primal and dual feasible solutions and 
updates each primal variable Xj whenever a Karush-Kuhn-Tucker (KKT) condition Yhi Vi^ij = Wj is 
not approximately satisfied. In previous work, relevanf updafe rules include: |[25l (for a = 1), where fhe 
updafe of each variable Xj is proportional to the difference Wj — UiAij, and JTl (for a = 0 ), where 

each Xj is updated by a multiplicative factor 1 ± /3, whenever Xj" Yli = xuj is not approximately sat¬ 
isfied. For our fechniques (addressing a general a) such rules do nof suffice and we infroduce fhe following 
modifications: (i) in fhe a < 1 case we use mulfiplicafive updafes by factors (1 -|- /3i) and (1 — /32), where 
/3i / P 2 and (ii) we use additional threshold values 6j to make sure that xj’s do not become too small. These 
thresholds guarantee that we maintain a feasible solution, but they significantly complicate (compared to the 
linear case) the argument that each step makes a significant progress. 

Dual Variables. In a-FAiRPSOLVER, a dual variable pi is an exponential function of the con¬ 
straint’s relative slack: yi{x) = C ■ where C and k are functions of global input parameters 

a, W ma x, n, m, and A m a x. Packing LP algorithms l|3l|7l[8l[T7l|T8l[28l|43l use similar dual variables with 
(7 = 1. Our work requires choosing (7 to be a function of a, tCmax, n,m, A^ax rather than a constant. 
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Paper 

Number of Distributed Iterations,^ 

Statelessness 

Notes 

ca 


Semi-stateless,^ 

Only for a = 1 

ii 


Not stateless 


m 

poly(e“\m,n, Amax) 

Semi-stateless 


[this work] 

0(e“®hA(i?,„mnAi„ax/e)) 

Stateless 



Table 1: Comparison among distributed algorithms for a—fair packing. 


Convergence Argument. The convergence analysis of a-FAiRPSOLVER relies on the appropriately 
chosen concave potential function that is bounded below and above for xj G 1], Vj, and that increases 
with every primal update. The algorithm can also be interpreted as a gradient ascent on a regularized 
objective function (the potential function), using a generalized entropy regularizer (see 111131). A similar 
potential function was used in many works on packing and covering linear programs, such as, e.g., in f7l 
and (implicitly) in 1471 . The Lyapunov function from |[25l is also equivalent to this potential function when 
yi{x) = C ■ \ji_ As in these works, the main idea in the analysis is to show that whenever 

a solution x is not “close” to the optimal one, the potential function increases substantially. However, our 
work requires several new ideas in the convergence proofs, the most notable being stationary rounds. A 
stationary round is roughly a time when the variables Xj do not change much and are close to the optimum. 
Poly-logarithmic convergence time is then obtained by showing that: (i) there is at most a poly-logarithmic 
number of non-stationary rounds where the potential function increases additively and the increase is “large 
enough”, and (ii) in all the remaining non-stationary rounds, the potential function increases multiplicatively. 
Our use of stationary rounds is new, as is the use of Lagrangian duality and all the arguments that follow. 

Relationship to Previous Work. Very little progress has been made in the design of efficient distributed 
algorithms for the general class of a-fair objectives. Classical work on distributed rate control algorithms 
in the networking literature uses a control-theoretic approach to optimize a-fair objectives. While such an 
approach has been extensively studied and applied to various network settings 1125 1I261I321I38114111461 . it has 
never been proven to have polynomial convergence time (and it is unclear whether such a result can be 
established). 

Since a-fair objectives are concave, their optimization over a region determined by linear constraints 
is solvable in polynomial time in a centralized setting through convex programming (see, e.g., mil Sol). 
Distributed gradient methods for network utility maximization problems, such as e.g., ll9l[39l summarized in 
TableSl can be employed to address the problem of a-fair packing. However, the convergence times of these 
algorithms depend on the dual gradient’s Lipschitz constant to produce good approximations. While Il9ll39l 
provide a better dependence on the accuracy e than our work, the dependence on the dual gradient’s Lipschitz 
constant, in general, leads to at least linear convergence time as a function of n, m, and Amax- 

As mentioned before, some special cases have been addressed, particularly for max-min fairness (a 
oo) and for packing LPs (a = 0). Relevant work on max-min fairness includes II101114112211271I301I341I36I . but 
none of these works have poly-logarithmic convergence time. There is a long history of interesting work on 
packing LPs in both centralized and distributed settings, e.g., lfT]l3f7llMT^19ll28ll29ll33ll43ll47l . Only a few of 
these works are stateless, including the packing LP algorithm of Awerbuch and Khandekar Q, flow control 
algorithm of Garg and Young lfT9l . and the algorithm of Awerbuch, Azar, and Khandekar f5T| for the special 
case of load balancing in bipartite graphs. Additionally, the packing LP algorithm of Allen-Zhu and Orecchia 
ll3l is “semi-stateless”; the lacking property to make it stateless is that it requires synchronous updates. The 
a = 1 case of a-fair packing problems is equivalent to the problem of finding an equilibrium allocation 

^The convergence times in 1^115113^ are not stated only in terms of the input parameters, but also in terms of intermediary 
parameters that depend on the problem structure. Stated here are our lowest estimates of the worst-case convergence times. 

"'a distributed algorithm is semi-stateless, if all the updates depend only on the current state of the constraints, the updates are 
performed in a cooperative but non-coordinated manner, and the updates need to be synchronous O. 
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in Eisenberg-Gale markets with Leontief utilities (see ITSl ). Similar to the aforementioned algorithms, the 
algorithm from ifTSll converges in time linear in e~^ but also (at least) linear in the input size (see Table 1). 

In terms of the techniques, closest to our work is the work by Awerbuch and Khandekar Q and we 
now highlight the differences compared to this work. Some preliminaries of the convergence proof follow 
closely those from Q : mainly. Lemmas I4.4114.61 and 14.91 use similar arguments as corresponding lemmas 
in 1^. Some parts of the lemmas lower-bounding the potential increase in a < 1, a = 1, and a > 1 cases 
(Lemmas 14.101 |4~T^ and 14.221) use similar arguments as Q, however, even those parts require additional 
results due to the existence of lower thresholds Sj. 

The similarity ends here, as the main convergence arguments are different than those used in Q. In 
particular, the convergence argument from Q relying on stationary intervals cannot be applied in the setting 
of a—fair objectives. More details about why this argument cannot be applied and where it fails are provided 
in Section IH As already mentioned, we rely on the appropriately chosen dehnition of a stationary round. 
To show that in a stationary round a solution x is e—approximate, we use Lagrangian duality and bound the 
duality gap through an intricate case analysis. We remark that such an argument could not have been used 
in 0, since in the packing LP case there is no guarantee that the solution y is dual-feasible. 

Organization of the Paper, The rest of the paper is organized as follows. Section |2] provides the back¬ 
ground. Section [3] describes the algorithm, and Section |4] provides the convergence analysis and structural 
results. Section [5] concludes the paper. 

2 Preliminaries 

Weighted a-Fair Packing. Consider the following optimization problem with positive linear (packing) 
constraints: (Qa) = max{pQ,(x) = Yl]=i ■ Ax < b,x > 0}, where fa{xj) is given by ([T]), x = 

(xi,..., Xn) is the vector of variables, A is an m x n matrix with non-negative elements, and b = {bi, ...,bm) 
is a vector with strictly positiv^ elements. We refer to {Qa) as the weighted a-fair packing. The following 
dehnition and lemma introduced by Mo and Walrand |[38ll characterize weighted a-fair allocations. In the 
rest of the paper, we will use the terms weighted a-fair and a-fair interchangeably. 

Definition 2.1. |[^ Let w = (rui, ...,Wn) be a vector with positive entries and a > 0. A vector x* = 
(xj,..., X*) is weighted a-fair, if it is feasible and for any other feasible vector x: 

Lemma 2.2. ll^ A vector x* solves {Qa)for functions fa{x*j) if and only if it is weighted a-fair. 

Notice in {Qa) that since bi > 0, Vi, and the partial derivative of the objective with respect to any of the 
variables xj goes to oo as xj —)• 0, the optimal solution must he in the positive orthant. Moreover, since the 
objective is strictly concave and maximized over a convex region, the optimal solution is unique and {Qa) 
satishes strong duality (see, e.g., fT3l l. The same observations are true for the scaled version of the problem 
denoted by {Pa) and introduced in the following subsection. 

Normalized Form. We consider the weighted a-fair packing problem in the normalized form: 

{Pa) = max{pQ,(x) : Ax < l,x > O}, 

where Pa{x) = Yl'j=i'^jfoi{xj), fa is dehned by ([T]), w = {wi,...,Wn) is a vector of positive weights, 
X = (xi,..., Xn) is the vector of variables, A is an m x n matrix with non-negative entries, and 1 is a size-m 
vector of I’s. We let Amax denote the maximum element of the constraint matrix A, and assume that every 
entry Aij of A is non-negative, and moreover, that Ajj > 1 whenever Ajj / 0. The maximum weight is 
denoted by rumax and the minimum weight is denoted by rumin- The sum of the weights is denoted by W and 
the ratio by We remark that considering problem {Qa) in the normalized form {Pa) is without 
loss of generality: any problem {Qa) can be scaled to this form by (i) dividing both sides of each inequality 

^If, for some i, bi = 0, then trivially Xj — 0, for all j such that Aij f 0. 
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i by bi and (ii) working with scaled variables c • Xj, where c = min{l, min{j - 5 ^}. Moreover, such 

scaling preserves the approximation (1X1). 

KKT Conditions and Duality Gap We will denote the Lagrange multipliers for (Pq) as y = {yi, 
and refer to them as “dual variables”. The KKT conditions for (P^) are (see Appendix iBl): 


^AijXj<l, Vi G {1, 
i=i 


Xj > 0 , 

Vi > 0, 


Vi ■ 


E ^ijXj - 1) = 0, 


i=i 

m 

E Vi^ij = 


Vj G ,n} 

Vi G {1, m} 
Vi G {1, m} 

Vj G 


(primal feasibility) 

(dual feasibility) 
(complementary slackness) 

(gradient conditions) 


i=l 


The duality gap for a / 1 is (see Appendix iBl): 


X'} / Q; —1 


Ga{x,y) = E Wj ^ {i 


i=i 


- 1 ) + E Vi - E 

1=1 j=i 


WjX- 


A—Oi 


where P = 


yi^ij 


, while for a = 1 : 


Gi{x,y) = -Yj Wj\n 
i=i 


^jEi=lVi^ij 


Wn 


m 

+ Y Vi-w. 

i=l 


(Kl) 

(K2) 

(K3) 

(K4) 

( 2 ) 

(3) 


Model of Distributed Computation We adopt the same model of distributed computation as 

, described as follows. We assume that for each j G n}, there is an agent controlling the 

variable Xj. Agent j is assumed to have information about the following problem parameters: (i) the 
column of A, (ii) the weight wj, and (iii) (an upper bound on) m, n, rcmax. and Amax- In each round, agent 
j collects the relative slack@ 1 — Ej^=i ^ij^j constraints i for which Aij / 0. 

We remark that this model of distributed computation is a generalization of the model considered in 
network congestion control problems Il26ll where a variable Xj corresponds to the rate of node j, A is a 0-1 
routing matrix, such that A^ = 1 if and only if a node j sends flow over link i, and b is the vector of link 
capacities. Under this model, the knowledge about the relative slack of each constraint corresponds to each 
node collecting (a function of) congestion on each link that it utilizes. Such a model was used in network 
utility maximization problems with a-fair objectives Il25ll and general strongly-concave objectives f9j. 


3 Algorithm 


The pseudocode for the a-FAiRPSOLVER algorithm that is run at each node j is provided in Fig 1. The 
basic intuition is that the algorithm keeps KKT conditions (IKII) and (IK2I) satisfied and works towards (ap¬ 
proximately) satisfying the remaining two KKT conditions (IK3I) and (IK4b to minimize the duality gap. The 
algorithm can run in the distributed setting described in Section |2] In each round, an agent j updates the 
value of Xj based on the relative slack of all the constraints in which j takes part, as long as the KKT con¬ 
dition (IK4I) of agent j is not approximately satisfied. The updates need not be synchronous: we will require 
that all agents make updates at the same speed, but without access to a global clock. 

To allow for self-stabilization and dynamic changes, the algorithm runs forever at all the agents, which 
is a standard requirement for self-stabilizing algorithms (see, e.g., iflhlU . The convergence of the algorithm 
is measured as the number of rounds between the round in which the algorithm starts from some initial 
solution and the round in which it reaches an e—approximate solution, assuming that there are no hard reset 
events or node/constraint insertions/deletions in between. 


“The slack is “relative” because in a non-scaled version of the problem where one could have bi A agent j would need to 
bi — YA ^ ■ 

have information about — - ^. 
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Q!-FAIRPSOLVER(e) 


(Parameters 6 j, C, k, 7, /3i, and /32 are set as described in the text below the algorithm.) 
In each round of the algorithm: 

1: Xj ■(— ma.x{xj, 6 j}, Xj = min{a;j, 1} 

2: Update the dual variables: yi = C ■ g 

3. if < (i-j) then 

4: ' (1 “1“ /^i) 

5: else ^ ^ 

6 : if > (1+7) then 

7: Xj <— maxjxj • (1 — ^ 2 ), Sj} 


Figure 1: Pseudocode of o-FairPSolver algorithm. 


Without loss of generality, we assume that the input parameter e that determines the approximation 
quality satisfies e < min{i, for any a, and e < for a < 1. The parameters Sj, C,k,^, /3i, and 
j 32 are set as follows. For technical reasons (mainly due to reinforcing dominant multiplicative updates of 
the variables Xj), we set the values of the lower thresholds 6j below the actual lower bound of the optimal 
solution that we derive in Lemma 14.29 1 


<^7 = 


Wn 


Wr, 


Ija 


1 


f^-Au 

1 


:) 


1/a 


if 0 < a < 1 
if a > 1 


2-1/a I 

<5i, (5min = miiij 6j. The constant C that multiplies the exponent in the dual 


We denote _ 

variables yi is chosen as C = Because 61 only depends on Wj and on global parameters, we also 

have C = Vj. The parameter k that appears in the exponent of the Uj’s is chosen as k = - In 

Oj £ \ SW^nin ' 


The “absolute error” of (IK4I) 7 is set to e/4. For a > 1, we set /3i = /32 = /?> where the choice of /3 is 
described below. For a < 1, we set Bi = B, 82 = /3^(ln(x^))~^- 

Similar to ||7|, we choose the value of /3 so that if we set ^i = h — round the value of each 

^ ^ changes by a multiplicative factor of at most (1 zb 7/4). Since the maximum increase over 

any xj in each iteration is by a factor 1 + /3, and x is feasible in each round (see Lemma 14^ . we have that 
EUAi: jXj < 1, and therefore, the maximum increase in each yi is by a factor of A similar argument 
holds for the maximum decrease. Hence, we choose jB so that: 


and it suffices fo sef: 


(l + /3)"e'^^ <1 + 7/4 

I 


and (1 - > 1 - 7/4, 


7 


/? = 


_ ) 5(k+1)’ 


1 


7 


SfK + o) ' 

Remark: In fhe a < 1 cases, since (32 = 0^{\ri{\/5rau 
a factor (1 - (7/4) • /3(ln(l/(5min))“^), Vj. 


if a < 1 
if a > 1 


)) fhe maximum decrease in 




is by 


4 Convergence Analysis 

In fhis section, we analyze fhe convergence fime of a-FAiRPSOLVER. We firsf sfafe our main fheorems 
and provide some general resulfs fhaf hold for all a > 0. We show fhaf sfarfing from an arbifrary solution, 
fhe algorifhm reaches a feasible solution wifhin poly-logarifhmic (in fhe inpuf size) number of rounds, and 
mainfains a feasible solufion forever affer. Similar fo l|7l|25l|471, we use a concave pofenfial funefion fhaf, 
for feasible x, is bounded below and above and increases wifh any algorifhm updafe. Then, we analyze fhe 
convergence time separately for fhree cases: a < 1, a = 1, and a > 1. Wifh an appropriafe definition of a 
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stationary round for each of the three cases, we show that in every stationary round, x approximates “well” 
the optimal solution by bounding the duality gap. On the other hand, for any non-stationary round, we show 
that the potential increases substantially. This large increase in the potential then leads to the conclusion that 
there cannot be too many non-stationary rounds, thus bounding the overall convergence time. 

We make a few remarks here. First, we require that a be bounded away from zero. This requirement is 


without loss of generality because we show that when a < 


e/4 

ln(nAnia,x/e) 


, any e—approximation LP provides 


a 3e—approximate solution to (Pa) (Lemma 14.301 ). Thus, when a < 

e/4 


e/4 

ln(nyln,ax/£) 


we can switch to the 


algorithm of f7i|, and when a > /g) . the convergence time remains poly-logarithmic in the input 

size and polynomial in s~^. Second, the assumption that e < in the a < 1 case is also without 
loss of generality, because we show that when a is close to 1 (roughly, 1 — 0 (e^/ln^(i?p,mn^max/£)))> 
we can approximate (Pa) by switching to the a = 1 case of the algorithm (Lemma [4. 3 111 . Finally, when 
a > 1, the algorithm achieves an e—approximation in time ln^(i?^„nm^max£~^))■ We believe 

that a polynomial dependence on a is difficult to avoid in this setting, because by increasing a, the gradient 
of the a-fair utilities fa blows up on the interval ( 0 , 1 ): as a increases, fa(x) quickly starts approaching a 
step function that is equal to — oo on the interval (0,1] and equal to 0 on the interval (1, oo]. To characterize 
the behavior of a—fair allocations as a becomes large, we show that when a > e“^ln(i?„,nAmax)! all the 
coordinates of the a—fair vector are within a 1 ± e multiplicative factor of the corresponding coordinates of 
the max-min fair vector (Lemma [4.341) . 

Finally, we note that the main convergence argument from fT] that uses an appropriate definition of 
stationary intervals does not extend to our setting. The proof from Q “breaks” in the part that shows 
that the solution is e—approximate throughout any stationary interval, stated as Lemma 3.7 in f7l. The 
proof of Lemma 3.7 in 0 is by contradiction: assuming that the solution is not e—approximate, the proof 
proceeds by showing that at least one of the variables would increase in each round of the stationary interval, 
thus eventually making the solution infeasible and contradicting one of the preliminary lemmas. For a > 
1 , unlike the linear objective in 0 , a-fair objectives are negative, and the assumption that the solution 
is not e—approximate does not lead to any conclusive information. For a < 1, adapting the proof of 
Lemma 3.7 from 0 leads to the conclusion that for at least one j, in each round t of the stationary interval 

^ - — <1 — 7 , where x* is the optimal solution, and x* is the solution at round t. In 0, where 

a = 0 , this implies that Xj increases in each round of the stationary interval, while in our setting (a > 0 ) it 
is not possible to draw such a conclusion. 


Main Results. Our main results are summarized in the following three theorems. The objective is denoted 
by Pa(x), X* denotes the solution at the beginning of round t, and x* denotes the optimal solution. 

Theorem 4.1. (Convergence for a < 1) ck-FairPSolver solves (Pa) approximately for a <lin time that 
is polynomial in In particular, after at most 

O (a~‘^e~^ In^ (R^mnA^s^f) \t? (e~^ (4) 
rounds, there exists at least one round t such that pa(x*) — Pa(x^) < epa(x^). Moreover, the total number 
of rounds s in which Pa(x*) — Pa(x^) > epa(x^) is also bounded by d?]). 

Theorem 4.2. (Convergence for a = 1) a-FAiRPSOLVER solves (Pi) approximately in time that is poly¬ 
nomial in ln(i?^nmAmax)- In particular, after at most 

O (e~^ In^ (7?^nm^niax) In^ (e"^i?^nmAmax)) (5) 

rounds, there exists at least one round t such that p(x*) — p(x^) < sW. Moreover, the total number of 
rounds s in which p(x*) — p(x®) > eW is also bounded by (O. 

Theorem 4.3. (Convergence for a > 1) ck-FairPSolver solves (Pa) approximately for a > 1 in time that 
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is polynomial in e ^ ln(nmj4max)- particular, after at most: 

O In {RwnmAras.yi) In RwUmA^s.^) ( 6 ) 

rounds, there exists at least one round t such that Pa{x*) — Pa{x^) < ^i—Paix*^))- Moreover, the total 
number of rounds s in which Pa{x*) — Pa{x^) > ^i—Pa{x^)) ^ also bounded by dH). 

Feasibility and Approximate Complementary Slackness. The following three lemmas are preliminar¬ 
ies for the convergence time analysis. Lemma l4!4l shows that starting from a feasible solution, the algorithm 
always maintains a feasible solution. Lemma l43] shows that any violated constraint becomes feasible within 
poly-logarithmic number of rounds, and remains feasible forever after. Combined with Lemma |4^ Lemma 
14.51 allows us to focus only on the rounds with feasible solutions x. Lemma 14.61 shows that after a poly- 
logarithmic number of rounds, approximate complementary slackness (KKT condition (IK3I) ') holds in an 
aggregate sense: YT=i - l) ~ 0 . 

Lemma 4.4. If the algorithm starts from a feasible solution, then the algorithm maintains a feasible solution 
x: Xj > 0, Vj and Y2'j=i ^ij^j — each round. 

Proof By the statement of the lemma, the solution is feasible initially. From the way that the algorithm 
makes updates to the variables Xj, it is always true that Xj > 0, Vj. 

Now assume that x becomes infeasible in some round, and let x^ denote the (feasible) solution before 
that round, x^ denote the (infeasible) solution after the round. We have: 

n n 

< 1, Vi E {1,..., m}, and for some A; E {1,..., m}. 

t=i 1=1 

For this to be true, x must have increased over at least one coordinate j such that Aj^j / 0. For such a 
change to be triggered by the algorithm, it must also be true that: 

m 

yi{x^)Aij < wj (1 - 7) ■ 

i=l 

Since, by the choice of /3i = /3, this term can increase by a factor of at most 1 -|- 7 / 4 , it follows that: 

m 

(x])" - 7) (1 + J) < Wj. 

i=l 

This further implies: 

{x])°‘yk{x^)Akj < Wj, 

and since whenever A^j / 0 we also have A^j > 1 , we get: 

{x])'^yk{x^) < Wj. ( 7 ) 

On the other hand, since xj > 6 j, 5j^ = and Akjxj > 1: 

(xj)"r/fc(x^) >^-C- > Wj, 

which contradicts Q. □ 

Lemma 4.5. If for any i: ^ij^j > after at most ti = 0{-^ In(nAmax)) rounds, it is always 

true that ^ij^j — 

Proof. Suppose that 1 some i. Then yi > C, and for every Xj with Aij ft 0: 

m 

> Wj > Wj{l - 7 ), 

l=l 

and therefore, none of the variables that appear in i increases. 


Ai'X' 

Since y^”_i > 1, there exists at least one Xh with At ^ 0 such that Xh > ——■ 

^—'J — ^ Jo/ / Allien 

For each such Xk, since C > 2t(;max^^max' 

m ^ 

a^fc" y yi{x)Aij > C— -> 2rcmax > Wk{l + -f), 

^^max 

and therefore, x^ decreases (by a factor (1 — (32))- As Xk < 1, after at most 0(^ In(nAmax)) rounds in 
which > 1 , we must have Xk < —, and therefore, Yl]=i ^ij^j — 

Using the same arguments as in the proof of Lemma 1441 the constraint i never gets violated again. □ 

Lemma 4.6. If the algorithm starts from a feasible solution, then after at most tq = ^ In rounds, it 

is always true that: 


1. There exists at least one approximately tight constraint: max* | X]j=i ^ 1 ~ (1 + 

2- TIiLi yi < (1 + 3e) YTj=i Xj YT=i vAij, and 

3- (1 - 3e) E™ 1 y^ < TZi < ET=i m- 


Proof Suppose that maxj EEi < 1 — £• Then for each yi we have: 


yi<C- = U ■ 


ewr, 


ewr. 


CmA^s.^ mA^s.^' 

Due to Lemma l4~4l we have that x is feasible in every round, which implies that Xj < 1 Vy. This further 
gives: 


m 

X° y yiAij < Wj£ < Wj{l - 7), 
i=l 

and, therefore, all variables Xj increase by a factor 1 + /?. From Lemma 14.41 since the solution always 
remains feasible, none of the variables can increase to a value larger than 1. Therefore, after at most tq = 

logi_,_^ (t^) — (t^) there must exist at least one i such that Ey=i If in any 

round maxj Ey=i ^ij^j decreases, it can decrease by at most (32 Ej^=i Aijx j < (3 Ei=i ^ijXj < /3 < ^- 
Therefore, in every subsequent round 


n ^ 

AijXj > 1 - (^1 + —)e. 

i=i 

For the second part of the lemma, let S' = {i : Ej^=i < niax^gji^ „j} Ej^=i ^kjXj — be 

the set of constraints that are at least “^^e-looser” than the tightest constraint. Then for i G S we have 

/t—1 E ^/c E 

Vj < e 5 max vu < — e ' max vu < 1.2— max Vh. 

A:E{l,...,Tn.} TXl k£{l,...,7n} Tfl 


This further gives: 

m 

y yt = y yt + y yfc < (1 + i- 2 e) y y*. 

i=l i&S k^S iiS 

Moreover, for each i f: S we have pi Ey=i ^ij^j ^ (1 “ l-2e)yi, since for i ^ S: 


E 

i=i 


AijXj > max 

/cG{l,...,m} 


i-kjXj 


K — 1 

5k 


e > 1 - 


^ 1 K — 1 

5k 5k 


e = l- 1.2e. 
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Therefore: 


Y^vi < 

2=1 


1 + 1 . 2 £ 


1 - 1 . 2 £ 


n 

i^S j=l 


n 

< (1+ 3e) Yy^Y 

i^S j=l 


(from e < 1 / 6 ) 


< (1 + 3e) Yy^Y 

i=l j=l 

Interchanging the order of summation in the last line, we reach the desired inequality. 

The proof of the last part of the lemma follows from feasibility: AijXj < 1, Vi (Lemma [4.41) . and 

from > 1 — 3e. □ 

Lemmas analogous to l4.4l and l4.6l also appear in f7l|- However, the proofs of Lemmas 14 .4 1 and 14. 6 I req uire 
new ideas compared to the proofs of the corresponding lemmas in fT]. We need to be much more careful 
in our choice of lower thresholds 5j and constant C in the dual variables, particularly by choosing (7 as a 
function of several variables, rather than as a constant. The choice of 6j’s is also sensitive as smaller 6j’s 
would make the potential function range too large, while larger 6j’s would cause more frequent decrease of 
“small” variables. In either case, the convergence time would increase. 


Decrease of Small Variables. The following lemma is also needed for the convergence analysis. It 
shows that if some variable Xj decreases by less than a multiplicative factor (1 — /32), i.e., xj < and Xj 
decreases, then Xj must be part of at least one approximately tight constraint. This lemma will be used later 
to show that in any round the increase in the potential due to the decrease of “small” variables is dominated 
by the decrease of “large” variables (i.e., the variables that decrease by a multiplicative factor (1 — / 32 )). 
Lemma 4.7. Consider the rounds that happen after the initial ti = 0{-^ ln(n74max)) rounds. If in some 

round there is a variable Xj < 

that: Viix) > ^k=i ^ik^k >1-1- 


j ^ 1 -IB 2 decreases, then in the same round for some i with Aij 0 it holds 


Proof Suppose that some Xj < triggers a decrease over the coordinate. The first part of the Lemma 
is easy to show, simply by using the argument that at least one term of a summation must be higher than the 
average, i.e., there exists at least one i with Aij / 0 such that: 


Viix)Aij > 


ET=iAiM^) 


m 


Vi > 


'krtAynaX 


For the second part, as Xj < we have that: 


yi{x) > 


{i- hT Xj^YT=i^ijyiix) 


^-^max ^-^max 

Since Xj decreases, we have that Xj^ YllLi Viix) Aij > (I+ 7 ), and therefore yi{x) > 

Moreover, as yi{x) = C ■ and C = it follows that: 




mA„ 


(1+7)(1-/32)° 


( 8 ) 


Observe that for a < 1: 


(1 + 7 )(i - hr > (1+ 7 )(i - ft) > (1 + 9 (1 - 


> 1 > v/e, 


(9) 
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while for a > 1 , since ea < ^: 


(1 + 7)(1 - /52)“ > (1 + 7)(1 - a/32) > (1 + 7 ) (l - > 1 > \/e, (10) 

where we have used the generalized Bernoulli’s inequality for (1 — / 32 )“ > (1 — a/ 32 ) ll32l> and then 


/32 = /3 = 5 (^+a) < 5 • Recalling that k = - In 


1 , / CmA^ 


ewr, 


CmAr, 


V ^^min 

^ik^k 


, and combining ([Hi with (|9ll and ([TOll : 


> 


V~e 


mA. 


Finally, as C > 2w,riaxnmAraax, it follows that 
gives: 


max 


^min^ ^ et/min _ ^ 

C*77T.74max -^max 




mAn 


< 1 , which 


< i « ^ > 1 - 1 . 

k=l 


□ 


Potential. We use the following potential function to analyze the convergence time: 

^ 771 

^>(x) =Paix) -Vyi(a:), 

Kj 

i=l 

where Pa{x) = Wjfa{xj) and fa is defined by ([T|l- The potential function is strictly concave and its 
partial derivative with respect to any variable Xj is: 


c)<h(x) 

dxj 


_ Wj 


Xi 


'^yi{x)Aij 


Wi 


1 - 


Xj"" EI^i yiix)A, 




( 11 ) 


7 

The following fact (given in a similar form in fT|), which follows directly from the Taylor series repre¬ 
sentation of concave functions, will be useful for the potential increase analysis: 

Fact 4.8. For a differentiable concave function f : EA —)• M and any two points x^,x^ G M”; 

E - 4) > f(4 - f(4 2 E ^(d - d)- 


i=i 


i=i 


Using Fact l4.8l and ([TT]) . we show the following lemma: 

Lemma 4.9. Starting with a feasible solution and throughout the course of the algorithm, the potential 
function <h(x) never decreases. Letting x^ and x^ denote the values of x before and after a round update, 
respectively, the potential function increase is lower-bounded as: 


<h(x^) — <h(x°) > ^ 


Wi 


x]-x']\^ {x]YYflfiyi{x^)Ai 


7? 


7 I 


’ (d)' 

Proof Since <h is concave, using Fact 14. 8 1 and (fTTl) it follows that: 


w 


j 


4>(x^) — 4>(x°) > 


w 


x]-x^j 


j=l 


1 - 






Wi 


( 12 ) 


If Xj = x^j, then the term in the summation fY2\ corresponding to the change in Xj is equal to zero, and 
has no contribution to the sum in (Y2\ . 


If x) — > 0, then, as x, increases over the observed round, it must be 


(^°)“EI2iyiU°)A 


<1 — 7. 
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By the choice of the parameters 






and therefore 




^ (1 + T) (1 - 7) = 1 - 77 - ^ < 1 - 77- 


( 13 ) 


It follows that 1 — 




w 


rf 't _ /Vi 0 


ix]Y 


1 - 


> I 7 > 0 , and therefore 

ix]yJ2Ziyiix^)Aij 


= w-i 


Xj Xj I 


Wi 


(x])“ 


1 - 




Wi 


Finally, if x] — x^, < 0, then it must be 




> 1 + 7. By the choice of the parameters, 




7^ EI7i^d^°)A 


We get that 1 — 


>( 1 - 1 ) 
{^]rET=ly^i^^)A^J 

Wj 


, implying 


> (l-y)(l + 7) = H-y7-^>l + 77. 


1 


(14) 


{x])'^YT=iyY^^)^' 


3 


< — 47 < 0 , and therefore 
1 

I j/ 

= w 






w 


3 


(^]) 


1 - 




Wi 


□ 


' (x])“ \ 

completing the proof. 

4.1 Proof of Theorem 14.11 

The outline of the proof is as follows. We first derive a lower bound on the potential increase (Lemma [4. 101 ). 
which will motivate the definition of a stationary round. Then, for the appropriate definition of a stationary 
round we will first show that in any stationary round, solution is 0(e)—approximate. Then, to complete 
the proof, we will show in any non-stationary round there is a sufficiently large increase in the potential 
function, which, combined with the bounds on the potential value will yield the result. 

The following lemma lower-bounds the increase in the potential function in any round of the algorithm. 
Lemma 4.10. If a < 1 and <h(x°), x^, y{x^) and <h(x^), x^, y(x^) denote the values o/<h, x, and y before 
and after a round, respectively, and S~ = {j : Xj decreases}, then if x^ is feasible: 

1. ; 

2. <i>(rc‘) - > n(0) ((1 - 7) E].i a 7 ( 7 )'-” - ET.1 y.{x°) Ayi?),- 

3. 4 .(xl)-$(x»)>Si( 5 ;,j£-^) (EEl!/i(^“)E 7 l 

Proof 

Proof of 1. Observe that for j E S~, xj = m.ax{6j, (1 — /32)xj}. From the proof of Lemma |4~^ we have 
that: 


<h(x^) — 4>(x°) > 


f6S- 


(xj)“ [ Wj 


-1 


The proof that 

2...on1-« f (^j)'^EZiyi(x^)^i3 
A. ^Y^3> 1 

j&S- \ ^ 


- 1=0 


E 


Wj{Xj) 


07i-« (i^]rET=ly^i^^)A^J 


- 1 


Wi 
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is implied by the proof of part 3 of this lemma (see below). For eaeh j £ S , we have that: 


Wi 


-1 > (1 + 7)(1-7/4)-1>7/2, 


Therefore: 


BqX^- 

- 4>(X°) > fl(7) ^ 

/327 


= Q 


1-/32 


3' 

Y1 


j&S- 


Proof of 2. Let S~^ denote the set of j’s sueh that Xj inereases in the eurrent round. Then, reealling that 
for j £ S+ < 1 _ ^ and that from the ehoiee of parameters < 


(1 + 7/4) 




/y* 1 _ /y>Q 




1 - 


ix]rET=M^^)A^ 




3=1 


Wi 


w. 


x]-xO 


+) 


1 - 


ix]rET=ly^i^^)A^ 




Wi 


> 


> 


E 


w. 


x]^ 


.... ^ ("])“ 


x]-xO 


1 - (1 + 7/4) 




E 


w. 


{x]Y 


(1 - 7) - 




jes+ ' ^ 

Sinee j £ 5+, x) = (1 + /3)x'-, it follows that 


Wi 


4>(x^) - 4>(x°) > 


/? 




y; 10,(7)-” (1 - T) - 




(„0\aspm 

Observing that for any xj ^ 5+ we have that (1 — 7 )-^- - < *-*’ 


4.(x‘) - y^w,(7)'-“ (^(1 - 7) 


(7)"aiiw(o>")A 


Wi 


n m 


= n{(3) (1 - 7) E “ “ E E 

\ j=l j=l i=l 

Proof of 3. Let S~ denote the set of j’s sueh that Xj deereases in the eurrent round. In this ease not all the 
Xj’s with j £ S~ deerease by a multiplieative faetor (1 — / 32 ), sinee for j £ S~: xj = max{(l — I32)xj, (5j}. 
We will first lower-bound the potential inerease over Xj’s that deerease multiplieatively: {j : j £ S~ A 

+ _ ^ 0 (-\ _ T?,.^on tUot w 3 ^ Q-- (^j) > 1 , j 

HlA - ' ^ 


Xj(l — ( 32 ) > Sj}, so that xj = xj(l — / 32 )- Reeall that for j £ S : 

H)-E -7 vd^V. > _ 7 / 4 , > (1 _ It follows 


that: 


<F(x^) — $(x'^) > 


Y 2 


(1 - Y 2 Y 


E 


Wj{xj) 


0 ^ 1 —a 
3 ' 




V 


- 1 


Aa;°(l-/3)>+} 
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> /32 


E 






-1 


{j:j(^S Ax?(1-/32)><5j} 


Wi 


= n 


/32 


ln(l/5min) 


E 




ix^,rET=imix^)A. 




{j--j&S-Ax°{l-l32)>Sj} 


Wi 


-(1 + 7) • 


(15) 


Next, we prove that the potential increase due to decrease of Xj such that {j : j £ S A x^{l — (32) < 6j} 
is dominated by the potential increase due to x^’s that decrease multiplicatively by the factor (1-/32). 


Choose any Xj such that {j : j £ S A Xj{l — (32) < Sj}, and let ^j(x^) = 
Lemma 14771 there exists at least one i with Aij / 0, such that: 


07 _ (^°rEZiAiiyi(xO) 


. From 


2/i > 


Wj(xj)°' ^ 1 ^i(l-/ 32 )“.. 07 ^ 1-/32 Wj 


Wj{x^A°‘ 


mAn 


> 


mA 


> 


max 

n 


mA^so, (3/ 




^ ^ > 1 


and, (16) 
(17) 


k=l 


From (fTTl) . there exists at least one p such that Aip / 0 and 


ip 

1 - - 

Apxl > - 

^ n 


(18) 


Since x^ £ (0,1] and a £ (0,1), using (fT^ . we have that ^jp(xp)“ > AipX^ > Recalling (fT^ : 

m 


i=i 


> 


1 - § 1- h Wj 


n 


mA + 


^j(x°). 


Recalling that ^ = C > 2r(;max?^^?7i^max> it further follows that: 


m 

> 2 (l - I) (1 - ( 32 ) • n - Wmax •^i(a;°). 

Z=1 


(19) 


Because e < | and (32 < (3 = it follows that 2 (l — |) (1 — (32) > 1. Therefore: 

{x^p)^ET=iAipyi{x^) ^ (x0)“EI^iApyz(x0) 


> n ■ ^j{x^) = n 


^20) 


Wp rUmax Wj 

As a < 1, we have that (5j“ > Sj, and ^ ^ = C. Similar to (fT^ . we can lower-bound yi as: 

„0 X^ A 1 o .V.0 ' 


,^^1-/32 Wj x'jY.iyi{x)Aij I -(32 Wj x'jY,iyi{x)Aij 
yi{x) > —:-^ • — -> 


Then, recalling AipX^ > 


mAj^ 

1-1 


ax Wj TTlAmax 

and using (|2TI) . it is simple to show that: 


Wi 


( 21 ) 


Xp'^yiix^)Aip > n ■ x^j'^Aijyi{x^). (22) 

i 1=1 

As ij{x^) > (1 + 7 ) and x® > it immediately follows from (l20l) that Xp decreases by a factor (1 — (32)- 
In the rest of the proof we show that (l20l) and (1221) imply that the increase in the potential due to the 
decrease of variable Xp dominates the increase in the potential due to the decrease of variable Xj by at least 
a factor n. This result then further implies that the increase in the potential due to the decrease of variable 
Xp dominates the increase in the potential due to the decrease of all small x^’s that appear in the constraint 
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i (rcfc’s are such that Aik / 0, and >1 + 7 )- 

Consider the following two cases: Wp{x^)^~^ > (rujx'-)^"" and Wp{x^)^~^ < (wjX^y~‘^. 


Upyuypj ^ yujjUyj ^ 

Case 1: Wp(xpy~°‘ > {wjX^y~^. Then, using (1^ : 




Wp{Xp) 


0\l-a 


i4rET=iAipyiix^) . o,i-« 

- (1 + 7 ) > [WjxA 


\4rET=iAipyii^^) 


W„ 


Wr. 


- (1 +7) 


> {WjX^ 


0\1—a 


n 


> n ■ {wjXj) 


0\l-a 




(23) 


Case 2: Wp{xy)^ " < {wjX^A^ Then, using (1221) : 


_ (1 +^)') = - (1 + 

\ '^P / ;_1 


> n • (wjxp 

Combining (|2^ and (l24l) with (fTSl) . it follows that: 


z=i 

m 

> Xp'^Aipyi(x°) - (1 + 7)mj(x°)^"“ 

1 = 1 

m 

> n • x° ^ Aijyiix^) - (1 + 7)tUj(x°)^- 


z=i 


0\l-a 


Wi 


- (1 +7) 


(24) 


Finally, since for j ^ S : 


$(x^) - 4>(x°) > n{P2) Y1 

j&S- 


i^T^T=ly^i^yA^J 


Wi 


- (1 +7) 




4 >(x^) - $(x°) > 52(/32) ^mj(x°)i- 


-(l+7)j <0: 




= 0 


i=i 

/32 


tUi 


- (1 + 7) 


n m 


ln(l/5min) ' 


'^x]^yi{x^)Aij - {1 + j)J2wjix°y~ 

j=i i=i j=i 


completing the proof. 


□ 


Parts 2 and 3 of Lemma f7l| appear in a somewhat similar form in fTl. However, part 3 requires significant 
additional results for bounding the potential change due to decrease of small x/s (i.e., x/s that are smaller 
than pr^) that were not needed in fT]. The rest of the results in this paper are new. 

Consider the following definition of a stationary round: 

Definition 4.11. (Stationary round.) Let S~ = {j : Xj decreases}. A round is stationary if it happens after 
the initial tq + ri rounds, where tq = ^ ln(j+—) and xi = ^ In(n^niax). tind both of the following two 
conditions hold: 


1- EjeS-^j^i “<7Ej=iW'i®i ‘^,and 
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2. e;=i TZi < (1 + 57/4) E •=! 

In the rest of the proof, we first show that in any stationary round, we have an 0(e)—approximate 
solution, while in any non-stationary round, the potential function increases substantially. 

We first prove the following lemma, which we will then be used in bounding the duality gap. 

Lemma 4.12. After the initial tq + ri rounds, where tq = ^ ~ ^ lii(n 74 max), in each 

round of the algorithm: ^j{x) = ^ \/j_ 

Proof Suppose without loss of generality that the algorithm starts with a feasible solution. This assumption 
is w.l.o.g. because, from Lemma 1431 after at most ti rounds the algorithm reaches a feasible solution, and 
from Lemma l4~4l once the algorithm reaches a feasible solution, it always maintains a feasible solution. 

Choose any j. Using the same argument as in the proof of Lemma l4~4l after at most ^ lii(^) < tq 
rounds, there exists at least one round in which ^j{x) >1 — 7 (otherwise Xj > 1 , which is a contradiction). 

Observe that in any round for which ij{x) <1 — 7 , Xj increases by a factor 1 + /3i = 1 + /3. Therefore, 
the maximum number of consecutive rounds in which ij{x) < 1 — 7 is at most ^ In(^) < tq, otherwise 
Xj would increase to a value larger than 1, making x infeasible, which is a contradiction due to Lemma l4~4l 
The maximum amount by which ij{x) can decrease in any round is bounded by a factor 1 — E \n(ifs ) ~ 
1 — 4 . Therefore, using the generalized Bernoulli’s inequality, it follows that in any round: 

(i_2.i)->(,_,).(,_2)>,.52. 

□ 


A simple corollary of Lemma l4.12l is that: 

Corollary 4.13. After the initial tq + ri rounds, where tq = ^ ~ ^ lii(nAmax). in each 

round of the algorithm: Ei yi(x)Aij > (l — x) Ej 

Proof From Lemma l4.121 after the initial tq + ti rounds, it always holds ij{x) = 

Vj. Multiplying both sides of the inequality by WjXj^~^, Vj and summing over j, the result follows. □ 

Recall that Pa{x) = Ej 'nJjfa{xj) denotes the primal objective. The following lemma states that any 
stationary round holds an (1 + 6 e)-approximate solution. 

Lemma 4.14. In any stationary round: p{x*) < (1 + 6 e)p(x), where x* is the optimal solution to {Pa)- 
Proof Since, by definition, a stationary round can only happen after the initial tq + ti rounds, we have that 
X in that round is feasible, and also from Lemma 1431 Ei Vi ^ (1 + 3e) Ej Ei yi{x)A.ij- Therefore, 

recalling Eq. (12]l for the duality gap and denoting ^ ^ we have that: 


1 —« / l-a 

a 

j 


p{x*)-p{x) < G{x,y{x)) = 

j ^ 

^ X 
= 2^^^ I 


< 


\—A vL 


— a 

l—a 


— a 


1 j + yi(a^) - WjXj^ ""ij “ 

'' i j 

oiij ~ - ij +Y^yi{x) 

^ i 

a^j “ — + (1 + 3e) yi{x)Aij. (25) 


From Lemma l4.12[ > 1 — VJ. Partition the indices of all the variables as follows: 


57 67 


•S'! — s J : ^7 C 1 F > ^ “I F f ’ <52 — J : ^7 > 1 H j- 


57 


Then, using 


p(x*) - p(x) < Gi(x) + G 2 (x), 
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where: 


Gilx) = ^ Wj ^ “ - I'j + (1 + 3e) ^ Xj ^ yi{x)Aij 

jeSi “ ^ ^ jeSi i 

and 

G2 (x) = ^ ^ (a^j “ - I'j + (1 + 3e) ^ Xj 

jeS2 ^ ^ jeS2 i 

The rest of the proof follows by upper-bounding Gi{x) and G2(x). 

Bounding Gi{x). Observing that Vy: xj yi{x)Aij = WjXj^~°'^j, we can write Gi(x) as: 

G'i(x) = ^ “ + (1 + 3e)(l - - iV (26) 

feSi “ ^ ' 


Denote r{^j) = “ + (1 + 3e)(l — a)^j — 1. It is simple to verify that r{^j) is a convex function. 

Since G ^1 — ^,1 + Vj G Si, it follows that r{^j) < max{r(l — Sy/d), r(l + Sy/d)}. Now: 


r(l — 57/d) = ail — 


57 


+ (1 — q:)( 1 + 3e) (1 ~ 


57 


- 1 


r . 1 —a 


< Q:|^l ——j + (1 — a){l + 3e) — 1. 

If < 1, then as (1 — 57 /d)“^ < (1 + 27 ), it follows that (1 — 57 /d) ^ < 1 + 27 . Therefore: 

r(l — 57 /d) < a{l + 27 ) + (1 — a)(l + 3e) — 1 

= 270 ; + 3 • (1 — a)s = a^ + 3 • (1 — a)e 


= 3e ^1 — -a 

If > 1, then (using generalized Bernoulli’s inequality and e < 

r(l - 57/d) < a - - -+ (1 - «)(! + 3e) - 1 


(27) 


< a 


1 - 

< afl + 


(1 - 57/d) — 

+ (1 - «)(! + 3e) - 1 


< (1 - a 


< de(l — a). 


4 a 
57 1 — a 

d a 

5? + 3 . 


^ + (1 — Q :)(1 + Se ) — 1 


(28) 


On the other hand: 

r(l + 57) = a(^l + ^) “ + (1 - a)(l + 3e) (^1 + - 1 

7 Q: “h (1 — ^)(i- “h 4^) — 1 

= de(l-a). (29) 

Combining (l27]) -(l29l) with (l26l) : 

Gi(x) < de • ^ Wj . (30) 

iesi “ 

Bounding G2(x). Because the round is stationary and S 2 C S~, we have that: — 
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7 Using the second part of the stationary round definition and that YljeS 2 Si=i > 

(1 — 57 / 4 ) X]je52 (follows from Lemma l4.12l) : 


m m 


yi{x)Aij = '^Xk'^yi{x)Aik - xi'^yi{x)Aik 

jeS2 i=l fc=l i=l 1^82 «=i 


< (1 + 57 / 4 ) ^ WkXk^ " - (1 - 57 / 4 ) ^ WlXl^~ 

k=l 1^82 


< (1 + 57/4) Wjx/ “ + Y X] " 

i6S2 1^82 

n _ n 

< 7(1 + 57/4) Y ^ X] '^kXk^~ 


k=l 


k=l 


<4'yY'^kXk^ °‘ = eY'<^kXk^ “• 


(31) 


k=l 

Above, first inequality follows from Y1 r=i ^k E ili yi{x)Aik < (1 + 57 / 4 ) YJk=i WkXk^ “ (part 2 of the 
stationary round definition) and Corollary 14.131 Second inequality follows by breaking the left summation 
into two summations: those with j G S2 and those with I ^ 82- The third inequality follows from 82^8 
and part 1 of the stationary round definition. 

_ 1 — a 

Observe that as > 1 + 67/4 > 1, we have that e; “ <1. Using (|3TI ). it follows that: 

G2{x) = Y '^3 ^ “ - 1^ + (1 + 3e) ^ Xj Y yi{x)Aij 


k=l 


3&S2 


< (a - 1) X] 


X 


=3 
1—0 


i6S2 * 


Wi 


+ (1 + 3e) yi{x)Ai 


< (« - 1) E 


^ 1 — a 

3&S2 3&S2 i 

1—0 


W 


3 &S 2 


1 — a 


+ e{l + 3£)Y^kXk^~ 


k=l 


3 


^ 1 — CK 

Xk 


< -(1 - a) ^ + 2^^^ “ «) E^^I 


j&S2 


k=l 


— a 


Xk 


< - o^) Z^Wkj 


1—a 


< 


r, Xk 

2£Y'^kY 


k=l 

1-0 


— a 


k=l 


— a 


(32) 


Finally, combining (l30l) and (l32l) : 

p(x*) — p{x) < ^4e + 2e^ Y^ 


l—a 


feSi 


— a 


= 6£:p(a:). 


□ 


Proof of Theorem 14. 11 From Lemma 14.141 in any stationary round: p{x*) < p{x){l + 6e). Therefore, to 
prove the theorem, it suffices to show that there are at most O In^ {RyjmnA^a,^) In^ 
non-stationary rounds in total, where = mmax/w^min> because we can always run the algorithm for 
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e' = e/6 to get an e—approximation, and this would only affect the constant in the convergence time. 

To bound the number of non-stationary rounds, we will show that the potential increases by a “large 
enough” multiplicative value in all the non-stationary rounds in which the potential is not too “small”. For 
the non-stationary rounds in which the value of the potential is “small”, we show that the potential increases 
by a large enough value so that there can be only few such rounds. 

In the rest of the proof, we assume that the initial tq -|- n rounds have passed, so that x is feasible, and 
the statement of Lemma 14.61 holds. This does not affect the overall bound on the convergence time, as 

To + ri = 4 In f + 4- \ii{nAmax) = O In(nAmax) In () 

P ^ ^min ^ h 2 \P ^ ^min ^ / 


= o —J In(nAmax) In^ R; 


ae 


mnA„ 


In {R^mnA^ 


(33) 


^ .1 — 0. 

To bound the minimum and the maximum values of the potential <h, we will bound ^ wj and 


^ E* Viix). Recall that ^>(x) = V . ^ E* Viix). 


Since 5j = 


that: 


and 


2win3.xn‘‘mA 

max 

W 


— 1 

n 


1 — a 


w„ 


‘, X is always feasible, and xj < 1, Vy, we have 
W 


< 


Thus, we have: 


2'lUmaxn^?7lj4iuax 

0 < - Vyi(x) < 

K < ^ 


^min ^ ^ 

kC 


E' 


1 — 0 


< 


3 

Cm 

K 


1 — a 1 — a 


(34) 


(35) 


> - - m ■ C 

K 

> -0(m^n^Amax^t'max), 


and 


^max E 


i=i 


— a 


W 

1 — a 


(36) 


(37) 


Recall from Lemma lT^ that the potential never decreases. We consider the following three cases for the 
value of the potential: 

Case 1: <l>min < ‘h < Since in this case < 0, we have that yi{x) > k Ylj . From 

Lernmaim Ylj Xj Ei yi{x)Aij > (1 - 3e) , thus implying: 


E E ^ E 


.1 — 0 


Wi 


1 — a 


> 


^■E 


Wi 


1-0 


1 — a 


(38) 


3 I- 3 3 

as K > - and e < Combining Part 3 of Lemma l4.10l and (IMl) . the potential increases by at least: 


VL 




f- 


Vln(l/<5n 


= ( 




Vln(l/5min) 
2 


= n 


7 


Kln(l/5n 


k] (-$(x)) 


(-<h(x)). 
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Since the potential never decreases, there can be at most 

^ / I 1 


O J In 


ri 


= o 


Case 1 rounds. 


a e 

1 —a 


In^ In ( R 


nmAr, 


Case 2: —0( ) < $ < 0( • (75 -“ ). From Lemma 1431 there exists at least one i 

^ -^max ^ \ 1 a; \ .^lUmax'^ ^-^max ^ J 

such that Y^j ^ijXj ^ 1 — (1 + Since Aij < Vi, j, it is also true that Yj ^ 

and as Xj^~^ > Xj and k > it follows that Yj WjXj^~^ > (1 — ^(1 + ^)) 


'^min 

-J - 

-^max 


. From 


1-(1 + 1/k)£ 

^max ^ 

we also 


have E,- 


W_ ^ / _ t^rnin 

a V 2wrnaxn^mA 

n 


E 


1 —Q 


Wn 


1 — a 


> max < (1 — e(l + e)) 


. Therefore: 

1 Wmin W 


1 — a A„ 


Wr, 


1 — a V 2wm^^'rR'rnAr, 


If T> < • max (1 - eil + e))-r^ • • I y , 

— 10 I ^ 1 ' -'''1—a Amax ’ 1 —a V 2uimax)T. mAmax 


, then 




(39) 


> —K • max (1 - e(l + e))----, --- 

10 I 1 Cr ^rna.x 1 Gi 


From Lemma 1431 

'^yiiA)'^AijXj > (1 - 3e) ^yj(x) 




2wm»^in?mA 


^ q\ 9 J/.. ^ ll’min [ ll^min 

UO V ^’l-a ^max I-a V^lOmaxn^mAmax 


From the third part of Lemma ld. 101 the potential increases additively by at least 








Vln(l/<5„ 


1 Wr, 


w 


max • 


1 - a ^max ’ 1 - a V 2w ^»^ Ti?mAm«^ 


Wr, 


and, therefore, $ = fl ^ • ( 


_ ^rriin 

^Suimax^^mA 

max 


O / ln(l/(i^in)lC 

\ 7^ / \a e 


) “ j after at most 
1 1 


2 j — 3 {RwWfnAmax) In j Rj 


nmAr, 


rounds. 


Case 3: ^ • ( 


Wrn\r, 

, 2uiinaxn^mA max 


) “ ^ < <h < In this case, 4> = O (YIj '^ Al-a ) • round is 
stationary, then from Lemma l4.14[ p{x*) < (1 + 6 e)p(x). If the round is not stationary, then from Definition 
14.1 11 either: 


1 - Hk&s-'^kXk > lYTj=iWjXj^ “,or 


2- e;=i TT=1 >(1 + ^)EJ=1 


1 — OL 


If the former is true, then using the first part of Lemma l4. 101 the potential increases by at least D ^ ^ 

4-“ ^ 


0^'r 


Ylj''XjXj^ " = D ) j-(l—Q:)4*. If the latter is true, from the third part of Lemma l4. 101 the potential 

increases by at least D ^ in(i//. ) ) ' Ej ^ ( in(i// ) ) ' “ Q:)<h. It follows that there are at 
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most 


O I In 


JiL 

1—Q 


1 — a 1 ty / w^in \ - 

= O In^ {Rw ■ mnA^^^) In^ ^ 

non-stationary Case 3 rounds. 

Combining the three cases with the bound on tq + ri (l3^ . the total convergence time is at most: 
O ( -^-ir In^ (i?u,mn^max) In^ iRw ■ 




e 


rounds, as claimed. 

4.2 Proof of Theorem 14.21 


□ 


The proof outline for the convergence of a-FAiRPSOLVER in the a = 1 case is as follows. First, we show 
that in any round it cannot be the case that only “small” x/s (i.e., Xj’s that are smaller than decrease. 
In fact, we show that the increase in the potential due to updates of “small” variables is dominated by the 
increase in the potential due to those variables that decrease multiplicatively by a factor (1-/32) = (1 — /3) 
(Lemmas 14.151 and 14.161) . We then define a stationary round and show that: (i) in any non-stationary round 
the potential increases significantly, and (ii) in any stationary round, the solution x at the beginning of the 
round provides an additive 5VFe-approximation to the optimum objective value. 

Lemma 4.15. Starting with a feasible solution, in any round of the algorithm: 


1- Xk TJlLi Vi{x)Aik > ^ Ejes- Xj TT=i yi{x)Aij. 

'y T!,;—1 \ y y-' ‘^j 1 

■ ^{keS-:Xk>^} Wk — 2 ^j&S- Wj 

Proof Fix any round, and let , y{x^) and x^,y{x^) denote the values of x, y at the beginning and at the 
end of the round, respectively. If for all j G S~ x° > there is nothing to prove. 

Suppose that there exists some xO < that decreases. Then from Lemma 147/1 there exists at least one 
i G {1,..., m] such that Aij / 0, and: 

• > l-f,and 


yi{x) > 


^riAmax 


>(1-/3) 


1 


5y mAn 




Since EEi ^ikX^k > ^ “ f > there exists at least one p such that AipX^ > Recalling that C 
2w^a^n‘^mA 

max- 


__1 '> 
<5i - 


(1-/3) 1-i x^^ET=Mx^)Aiy 


{x )Aipyi{x )>C ■ 

mA 


n 


> 2wrr,.^r?mAr, 


(1-/3) 1-| x)YT=iyi(xAAij 


mA 

> 2nmmax(l - /3) (1 - 


n 

e\ x^^YA=iyi{x^)Aij 


> nw 


x^,TT=iyi{x^)Aiy 


^ /fcu/max 


Wj 


(40) 
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Since Xj decreases, it must be >1 + 7- Using (l40l) : 




0 sr^m 


— - > — - - > n— ->1 + 7, 


Wr, 


Wr, 


Wi 


and, therefore, Xp decreases as well. Moreover, since (l40l) implies 


\'^yi{x^)Ai 


p > 


E 


w 


max 0 
X 


1=1 


Wi 


''^yi{x^)Ai 


'3 — 


E 


''^yi{x^)Aij, 


1=1 


{jeS :Xj<j^AAiji^O} {j&S :Xj<j^AAij^0} 

the proof of the first part of the lemma follows. The second part follows from (l40l) as well, since: 


> 


Wr. 


> n 


U^majc 

4T.T=iyii^^)Ai, 

Wi 


which, given that Xj was chosen arbitrarily, implies: 


4T:T=iyii^^)Aip 


> 


Wr 


E 


U'eS :Xfe< j+^AAife^O} 


4ET=iyii^^)Aij 

Wk 


□ 

Lemma 4.16. Let x®, y{x^) and x^,y{x^) denote the values ofx, y at the beginning and at the end of any 
fixed round, respectively. If x^ is feasible, then the potential increase in the round is at least: 

1 . T>(xi) - ^>(x°) > U(^7) EjeS+ ^+' 

2. ci,(a;i) - T.(xO) > U(/3) ((1 - 7)1+ - E]= 1 4 ET=1 y^ix°)A^J). 

3. $(xi) - <h(xO) > U(/3) (e;=i xO ET=i y^i^°)A^j - (1 + ^ 


Proof 

Proof of[T} Recall that: 


Xa 


]-4(. 4TT=M^^)A 


Wi 


i=i 

Lete,(xi) = 


> 


E 


x 


Wi 


)~4(. 4^T=iyi{x^)^ 


^ x^- 

j&S+ ■? 


1 - 




Wi 


If j E , then xj = (1 + /S)xj' and (x°) < 1 — 7 . Since from the choice of parameters increases by 
at most a factor of 1 + 7 / 4 , it follows that: ^j(x^) < (1— 7 )(l+ 7 / 4 ) < 1 — | 7 , which gives l—^j(x^) > I 7 . 
Therefore: 


4>(x^) — <h(x‘^) > 


/3 


1 + /3 4 


7 


E ^r 


3 &S+ 


Proof of |2l The proof is equivalent to the proof of the second part of Lemma 14.101 and is omitted. 

q y-ix^^A - 

Proof of|3l Using that for j E S~ we have that —^^+2- — >1 + 7 and xj = max{(l — /3)xj, djj. 


22 






















we can lower bound the increase in the potential as: 


Wi 


rf‘ 1 _ 


0 3,1 

{i65-:x0>^} ^ 




W 


J 


13 


1-/3 


E 


Wi 


x]ET=ly^i^^)A^J 


- 1 


Wi 


{jes-.x°>j 4 g} 


E ^i((i- 7 / 4 ) 


1-/3 




- 1 


ru,- 


{j6S-:xO>^} 




rui 




- (1 + 7 


(41) 


_/3v* w*. ^ 

{jes 

Now consider A; G S~ such that From the proof of Lemma 14.151 for each such Xk there 

exists a constraint i and a variable Xp > with p £ S~ such that / 0, A^p^o^ Yli yii^^)^ip ^ 

^ Ez yi{x°)Aik, and > n ■ . If Wk < Wp then 

(xlY.iyi{x^)Aip n , ^ ( xlJ2iyiix^)Akp ,, , 

wp[ ---(l + 7)j>n;.(n---(l + 7)j 


Wk 

pEl9i( ) Ip (1+ 7 )) = (7E!''<“^°)'^'p “ 

I 

> yi{x^)Akp - (1 + 7)n^Zc 

i 


On the other hand, if Wk > Wp, then: 


Wr. 


pp 

0 X^ 


Wr, 


It follows from (1441) that: 


$(,i) _ $(,») > ^ 


1-/3 2 ^ 

J6S 


OY- ^.0 

Wk 


- (1 + 7 


Finally, since for j ^ S we have that 




<1 + 7 : 


<h(x^) — <F(x°) > 7-^-^-— 7 E ^ w 


x^ET=ly^i^^)A^, 


1-/3 2 

n 

(E E yi(^^)Aij - (1 + 7) ^ 


- (1 + 7 


i=i 

n m 


W. 


j=l 2=1 


3=1 


□ 


Consider the following definition of a stationary round: 

Definition 4.17. A round is stationary if it happens after the initial tq+ti rounds, where tq = ^ ln(l/(5min). 
Ti = ^ In(nAmax) tind if both of the following conditions hold: 
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• Ejes+ ^ W^o; 

• (1 - 2j)W < Xj E™ 1 y(x)Aij < (1 + 2-f)W. 


We first show that in any non-stationary round there is a sufficient progress towards the e—approximate 
solution. 

Lemma 4.18. In any non-stationary round the potential function increases by at least ■ WIrf). 

Proof. A round is non-stationary if either of the two conditions from Definition l4.17l does not hold. If the 
first condition does not hold, then from the first part of Lemma l4.161 the potential increases by -W/ tq) . 

If the second condition does not hold, then from either the second or the third part of Lemma 14.161 the 
potential increases by at least Q{l3'yW) > D(/37 • VL/ro). □ 


Before proving that in every non-stationary round, the solution is 0(e)—approximate, we will need the 
following intermediary lemma. 

Lemma 4.19. Starting with a feasible solution and after at most tq = ^ In rounds, in any round of 

the algorithm: 


min SSlidffk 


J Wj 

Proof First, we claim that after the algorithm reaches a feasible solution it takes at most tq -|- 1 additional 
rounds for each agent j to reach a round in which > x _ ^ Suppose not, and pick any 

agent k for which in each of the tq -|- 1 rounds following the first round that holds a feasible solution: 

<1 — 7- Then Xk increases in each of the rounds and after ^ In(^) < tq rounds we have 
> 1- Therefore, after at most tq -|- 1 rounds the solution becomes infeasible, which is a contradiction 
(due to Lemma l4^ . 

X - v {x)A- 

Now choose any Xj and observe ^ over the rounds that happen after the first tq -|- 1 

rounds. The maximum number of consecutive rounds for which < 1 — 7 is rj = ^ln(^) < tq, 
otherwise we would have Xj > 1, a contradiction. Since in any round, due to the choice of the algorithm 
parameters, decreases by at most a factor of 1 — 7 / 4 , the minimum value that can take is at least 
(1 — 7)(1 — 7 / 4 )'^^'/^ > (1 — 7 )^“, thus completing the proof. □ 


Now we are ready to prove that a solution in a stationary round is 0(e)—approximate. 
Lemma 4.20. In any stationary round: pi{x*) — pi{x) < SeW, where x* is the optimal solution. 


Proof. Since, due to Definition 14.171 a stationary round can only happen after the initial tq -|- n rounds, we 
have that in any stationary round the solution is feasible (Lemmas 14.41 and 14.51) and approximate comple¬ 
mentary slackness fLemma l4~6l) holds. 

Recall the expression for the duality gap: 


Wi 


Gi{x,y) = -'^Wj In 

j=i \ “'j / 

From the second part of Lemma 1431 


Xj TT=i yAij 


+ Y^yi-W. 


n m 


y* < (1 +3e) Xj yi^ij- 

j=l i=l 


2=1 


Therefore: 


Gi{x,y) < -E Wj In 

j=i 


xj ET=i mA. 

Wi 




n m 


(1 -h 3e) yiAij - W. 

j=l i=l 
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Since the round is stationary, we have that =1 YT=i < (1 + 27)VF, which gives: 


Gi{x,y) < -E Wj In 
i=i 


Xj TT=i 


Wi 


+ 4eW. 


( 42 ) 


Let . The remaining part of the proof is to bound — X]j=i ^ lii(O) 

For G (1 — 7, 1), we have that —wj In(^j) < jwj. To bound the remaining terms, we will use Lemma 
I4.19l and the bound of the sum of the weights wj for which G S~^ (that is, < 1 — 7). It follows that: 

n 

- ^ Wj In(^j) < - ^ Wk In(^fc) -'^Wi In(^z) 

j = l fc:€i;e(l-7,l) 1&S+ 

< 7 Wk — In ^(1 — 7)^°^ • wi (from Lemma l4.19p 

fc:^fee(l-7,l) 1&S+ 

W 

< 7VF + ro7 • — 

To 

= 27iy 

(43) 

Combining (|4^ and (1431) . and recalling that pi(x*) — pi(x) < Gi{x, y{x)), the result follows. □ 

Proof of Theorem 14.21 Consider the values of the potential in the rounds following the initial tq+ti rounds, 
where tq = ^ln(l/5min), ri = ^ ln(n74max) (so that the solution x is feasible in each round and the 

approximate complementary slackness holds). Observe that tq + ti = {nmAms.^R^)\n ( -^ 

We start by bounding the minimum and the maximum values that the potential can take. Recall (from 
Lemma l4~9l ) that the potential never decreases. 

Due to Lemma l4~4l xj G [6j, 1], Vj, and therefore we can bound the two summations in the potential as: 

Wj ln{xj) > ^ Wj ln{6j) = -0[w ■ In f nmAmax")(44) 

i j 


= lw. 


Wr, 


Wj ln(a:j) < Wj ln(l) < 0, 


rE^.( 


x) > —- 


mC 


K 


> -mC = -0{wuia.^n^rn‘‘A^s,^) 


and 


mC 


K K 

i 


• < 0 . 


From (1441) and (l46l ): 

*^111111 ^ O^WYnsaR TTl j4jnax)- 

On the other hand, from (|45] ) and (1471 ): 


(45) 

(46) 


(47) 

(48) 


4'max < 0. (49) 

Consider the following two cases: 

Case 1: > W ^^nm^max). Then < -4>(x) < and 

K ?/* (*) — From the third part of Lemma l431 we have that Yj Yiyi{x)Aij > (l-3e) YiViix) > 
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2W. Thus using the Part l2l of Lemma |4. 16 1 we get that the potential increases by 


^(/3) = O I /3 • j = 0(/3 k) • (-^>(x)). 


Finally, since /3k = 6 ( 7 ), there can be at most O 


iln 

7 


Ri i] nmAn 


W\n(R 

W nniTtmax) 


Case 1 rounds. 


Case 2 


: i EiViix) < FF • In (e • ^nmTlmax)- 


Then — 2FF • In 


(e • < ^>(x) < 0 . 


From Lemma 14.201 if a round is stationary, then p{x*) — p{x) < 5eFF. If a round is non-stationary, from 
Lemma l4. 181 the potential increases (additively) by at least n (/37 • W/tq). Therefore, the maximum number 
of non-stationary rounds is at most: 


O 


W ln(nmAmax'Wmax/^min) 
I3iW/to 


= O 


= o 


1 


• In^ (R^nmA^ 


In^ {Ry^nmA 


J In' {R. 


nmAn 


0 ' 


Combining the results for the Case 1 and Case 2, the theorem follows by invoking a-FAiRPSOLVER for the 
approximation parameter e' = e/5. □ 

4.3 Proof of Theorem 14.31 


The outline of the proof of Theorem 14.31 is as follows. First, we show that in any round of the algorithm 
the variables that decrease by a multiplicative factor (1 — / 32 ) dominate the potential increase due to all the 
variables that decrease (Lemma r4.211 ). This result is then used in Lemma l4.221 to show the appropriate lower 
bound on the potential increase. Observe that for a > 1 the objective function Pa{x), and, consequently, 
the potential function <1> (x) is negative for any feasible x. To yield a poly-logarithmic convergence time in 
Ry;,m, n, and Alnax^ the idea is to show that the negative potential —$(x) decreases by some multiplicative 
factor whenever x is not a “good” approximation to x* - the optimal solution to (Pa)- This idea, combined 
with the fact that the potential never decreases (and therefore — d>(x) never increases) and with upper and 
lower bounds on the potential then leads to the desired convergence time. 

Lemma 4.21. In any round of the algorithm in which the solution x^ at the beginning of the round is 
feasible: 

m ^ m 

^ yiix^)Aij 

and 

E f E - ( 1 +) 2 5 E 

Proof If Xj > jfp, Vj, there is nothing to prove, so assume that there exists at least one j with < T^- 
The proof proceeds as follows. First, we show that for each j for which Xj decreases by a factor less than 
(1 — /3) there exists at least one Xp that appears in at least one constraint i in which Xj appears and decreases 
by a factor (1 — /?). We then proceed to show that Xp is in fact such that 

m m 

Xp Y = ^in)x^j Y yiix^)^ij 
1=1 1=1 


i=l 
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and 


Xp'^yiix^)Aip-{^ + 'y)wpiXp)^ " = ^7(n) ( - (1+ 7)wj(x°)^- 


i=i 


i=i 
0 V^m 


This will then imply that the terms Vi{x^)^ip yi{x^)^ip~{^~^l)wp{XpY “ dominate 

the sum of all the terms corresponding to Xj’s with Aij / 0 and Xj < thus completing the proof. 

From Lemma 1471 for each j E S~ with Xj < there exists at least one constraint i such that: 

• ELi ^ikxl > 1 - f, and 


^ K( 2 ^") > (1 - f>)‘ 


Therefore, there exists at least one Xp with Aip / 0 such that AipX^l > —which further gives Aip{x^)'^ > 




•A 




W-^p ^ n 


pt 


^ _ . A 

ip ^ ^max 


1—a 


where the last inequality follows from 1 < Aj„ < Amax and a > 1. 


Combining the inequality for Aip{xY)'^ with the inequality for yi{x^) above: 

m 

{xirY.yi{x^)^ip>{xirA,py,{x^) 

(i-ir 


i=i 


> 


•A, 


l—a 




Y-YY 


1 w, {xTET=iyiix^)Aij 


m A A" 
;/4^max Uj 


Wi 


(1 - i)' 




— - (from C = ^) 


n a^a|, e\» ix^j)'"Etiyiix°)^lj ,, a +1 , 2 a-l^ 

> 2nr(;max(l -/3) 1 - - — - (from C > 2r(;maxra“+ "r^max )• 


Wi 


Using the generalized Bernoulli’s inequality: (l — |)“ > 1 — ^ and (1 — /?)" > (1 — /3a) OTl . and 
recalling that ea<^, /3<^ = |^< we further get: 

9 


iXpT'^yi{x^)Alp > ‘inWraa.y, ( 1 


l=l 


10 • 120 


1 - 


9\ {x^rTZMx^)Ai, 


20 


Wi 


> nw 


[x^TTZiyiixY^i^ 




Wi 


which further implies: 


ix^pT ET=iyi{x^)Aip ^ (x^r EI^i m(x'^)At, 

— - -> n • — - - 




(50) 


as w„ < w^ia^. Since x,- decreases, 


p 


^ £ 

Using similar arguments, as AipX^ > and recalling that yi{x^) > —YYiLi ^ijyi{x^) > 


>1 + 7, and therefore Xp decreases as well. 


■ x^jET=iAjyi{x^y. 


_ 1 — - 1 1 — /I 

xl'^yi{x^)Aip > x°Aipyj(x°) > 2 


„o 


i=i 


n mAoia,^ 6j 


'^Aijyiia 


> nx^j'^Aijyiix^), 


(51) 


i=i 
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as (5,- < , -and 2^/"(l - |)(1 - /3) > 2 9 =(1 - |)(1 _ |i) > 1 (since e E (0,1/6]). 


^ — 2^ /°‘n? rtiA- _ 

From (l51l) . it follows that 

m 

„0 


Xr 


^yi{x^)Aip > 


'=1 [k&S-■.Xk<-^^A^ki^0} '=1 

which further implies the first part of the lemma. 

For the second part, consider the following two cases: 

Case 1: Wp{x^Y~°' > Then: 

' {4rYT=iyi{^^)Ai, 


Xp yi{x^)Aip - (1 + -i)wp{xlY “ = Wpixlf- 


1=1 


Wr. 


- (1 + 7 ) 




> Wjixj) 


0\l-a 


i 4 rET=iyii^lAip 


Wr. 


- (1 + 7 ) 


> nwj{x^) 


n 




Wj 


00 -a / {x)YYT=iyi{x^)^i3 


- (1 + 7 ) 


(from dlOll) 


- (1 + i)wjix^Y~ 


1=1 


implying the second part of the lemma. 

Case 2: Wp{Xp)^~'^ < Wj{xjy~°'. Then: 


- (1+ 7)w^p(a^p)^ “ > - (1+ 7)^i^j(a:°)^ " 

1 = 1 
m 

- - (1 + l)wj{x°Y~°‘ (from dSB) 


i=i 


i=i 


> 




1=1 


thus implying the second part of the lemma and completing the proof. □ 

The following lemma lower-bounds the increase in the potential, in each round. 

Lemma 4.22. Let x^ and x^ denote the values of x before and after any fixed round, respectively, and let 
= {j : Xj > Xj}, S = {j : Xj < x^}. The potential increase in the round is lower bounded as: 

1. T>(xi) - ^>(x°) > n{py) x° YlZi yiix°)Aij; 

2 . hA) - -i>(i-») > n (jEIf) (e7i Ell - (1 + t) E”.i "-li'?)'-); 

$(ii) - $(i») > Si (j^) ((1 - 7) El, w,(x»)i-“ - E"., x» Ell K(^“)y 

Proof 

Proof of 1. From Lemma |4^ 


x\ — X^j\ 


Let ^j(x^) = 


<h(x^) — <h(x°) > Y^'^i — 
1=1 

lx _ (3:])° YfLiyi{A)Aij 


ix]Y 


^_iYrEZi]MxlY!l 


Wi 


. From the proof of Lemma WM if xj — > 0, then 1 — i^j(x^) > 
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\l > as 0 < < 1 — |7- If ~ < 0> Ih^ri 1 — ^j(x^) < —^, which implies 

1 < and thus 1 - Cj{x^) < + 7/2)"^ - 1) = ^j{x^)ji^ < = 


— ^^j{x^). Therefore: |1 — <( 7 > 

further gives: 

n- Ul _ . 

I d,j I 


1 - 




— 3 ui,' • 


which 


$(.7 - $(.“) > > 217 _,0|, fV,. 


j = l 


Z=1 


*j — 


If j G 5 +, thenx] = (l+/ 3 )x^, and therefore = (l - t^) a:] X]™ 12/i(a;^)A 

(1-i) 

Similarly, if j G S~ and then xj = (1 — / 3 )xj, and therefore |x] — x°| • yi{x^)^ij = 

- l) x] YJiLi yi{x^)Aij > (l - I) T^a;° Ya=i yi{x^)^ij- Using part 1 of Lemma| 42 ll 


o rn 

^>(x^) - $(x°) > '^yTB ^ X° ^yi(x°)^y. 

^ ie{S+us-} *=i 

Proof of 2 : Consider j G S“ such that x° > Then xj = (1 — /3)xj, > (1 + 7), 

and using Lemma 

{x]rTZM^^)Ary 


T>(x^) — <h(x‘^) > ^ 


W. 


I /y» 1 _ I 

\^3 


U6S-:xO>^} 


> 


E 


tc 


{x]y 


3 


1 - 


Wn 


„0^1-o 


\x]rET=iyii^^)A^ 


V 


'lo '• J' 


> 


U6S-:xO>^} 

/3 


( 1 -/ 3 ) 

> (1 - 7/4) 

= (1 - 7/4) 


( 1 -/ 3 ) 

y; (1-7/4) 


- 1 


tCi 




/3 


tCi 


(1-/3)^ 


/3 


^ Wjix'j) 


,0\l-a 


ix^JrET=ly^ix^)A, 


lies 


tCi 


- 1 


- (1 + 7 ) 


( 1 -/ 3 )“ 

{ieS-:xQ>3-^} 


X]yi(®°) - (1 + 


i=l 


Using the second part of Lemma 14.21 1 and the fact that for /c ^ S' : < (1 + 7 )> g^I 

the desired result: 


<I>(xi)-$(xO)>i(l-7/4)-^^-^ 


n m 


- (l + 7)X]^f(^i) 

yj = l i=l j = l 


0\l-a 


Proof of 3 : The proof is equivalent to the proof of Lemma l 4 . 101 part 2 , and is omitted for brevity. □ 

Consider the following definition of a stationary round: 

Definition 4 . 23 . (Stationary round.) A round is stationary, if both: 

1- Eje{S+u5-} a^i YT=i yi{x)Aij < 7 J2]=i Wj{x^j)^~'^, and 
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2. (1 - 27) e;=i < E ■=! E™ 1 

hold, where = {j : xj > x^}, S~ = {j : xj < x^}. Otherwise, the round is non-stationary. 

The following two technical propositions are used in Lemma 14.261 for bounding the duality gap in sta¬ 
tionary rounds. 

Proposition 4.24. After the initial the initial tq+ti rounds, where tq = ^ ln(l/(fniin). ^1 = ^ In(n^max). h 
is always true that Ga{x,y{x)) < Ej=i where _ 

Proof. Recall from dUl that the duality gap for x, y in {Pa) is given as: 


^1 —ol 


Ga{x,y) = '^WjY 


i=i 


- a I \xfYT=iyi^i. 


) i=\ J=1 


Ej=i 


Wa 


From Lemma l4~6l after at most initial tq + ri rounds: 

m n m 

^ ?/i < (1 + 3e) ^ Xj ^ yiA, 

j=l i=l 

l-a ( Ya= 1 Vi^ij 




2=1 


(1 + 3e) y~] 


WiX 




j=t 


Wi 


and letting P = ^ ggj. 


A—a. 


Ga{x,y) < - 1 + (1 + 3e)(l - a)^j - (1 - a)^j “ 

f=i 

X. _ 

= + (1 + 3e)(l - a)^j - 1 


i = l 


1—a 

t* . 

A 

— a 

„ 1 —a 


f 1 + (1 + 3e)(a - l)^j - ai] 

j = l V 


□ 


Proposition 4.25. Let ra(^j) = ( 1 + (1 + 3e)(Q; — l)^j — aP “ ), where . If a > 1 


and G (1 — 7 ,1 + 7 ) Vj G {1,..., n}, then ra{Cj) < e(3a — 2 ). 
Proof Observe the first and the second derivative of Tq, (^j): 


draiP) 




d^j 

d^ra{ij) ^ 1 

df^j^ a 


= L{a-l)^-G»-\ 


As > 0, r(^j) is convex for a > 1, and therefore: r{£^j) < max{r(l — 7 ), r(l + 7 )}. We have that: 

£ 


r(l — 7 ) = r(l — e/4) = 1 — ^1 — 


- a)(l + 3e) + a(l - e/4)"^/") 

1 - (1 - 0 (1 - a + 3e(l - a) + a(l + e/4)^/“) 

1 — ^1 — 0 (1 + e/4 + 3e(l — a)) (from (1 + e/4)^/" > 1 + e/(4a)) 


< 1 - 11 - 

= 1 — 1 — ^ + 3e(a — 1) + ^(1 + e/4 — 3e(a — 1)) 
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On the other hand: 


< e{3a - 2 ). 


r(l + 7) = r(l + e/ 4 ) = 1 - + 0 ((1 - a)(H- 3 e) + a(l + e/ 4 ) ^/“) 

< 1 - (^1 + 0 (1 - a + 3 e - 3 ea + a(l - e/ 4 )^/“) 


< 1-1 + 


1 “h ~~rS — ‘^EOt 

4 

< 1 — ^1 + ^e — 3eo; 

< e(3Q! — 2 ), 

completing the proof. □ 

The following lemma states that in any stationary round current solution is an (l+e(4Q;—l))-approximate 
solution. 

Lemma 4.26. In any stationary round that happens after the initial the initial tq + ti rounds, where tq = 
^ ln(l/(5niin). Ti = ^ In(n^max). have that pa{x*) — Pa{x) < e(4a — 1)(—Pq,(x)), where x* is the 
optimal solution to {Pa) (ind x is the solution at the beginning of the round. 

Proof Observe that for any k ^ {5''' U S~} (by the definition of S~^ and S~) we have that 1 - 7 < 

Wk 


<1 + 7, which is equivalent to: 

m 

(1 - y)wkx\~‘^ < Xfc ^ yi{x)Aik < (1 + y)wkx\~‘^ VA: ^ {S’+ U S~}. 
Using stationarity and 


( 52 ) 


2=1 


(1 - 27) Y1 - X] X] A 


i=i 


j=i i=i 


^ xi'^yi{x)Au+ ^ Xk'^yi{x)Aik 

l£{S+US-} i=l k^{S+US-} i=l 


Wkxl ". 


<7^u;jx] " + (1 + 7) 

i=i k^{s+us-} 

Since Eie{S+uS-} = E]=i ^jX^ “ Efc^{s+us-} using (l 53 ]l: 


(1 - 27 ) Y “ < 2 ' X] “ + (1 + 7 ) Y 


Wkxl " 


«e{5+uS-} 


i=i 

n 


fe^{5+U5-} 


(1 - 27 ) 

fe^{S+US-} 


( 53 ) 


Wkxl “ 


“ + 37 Y 

j=l fc^{S+US-} 


Wkxl " 


— A Y1 

i=i 


1—a 


and therefore: 


A n n 

Y. 

IG{S+US-} ^ i=i j=i 


( 54 ) 
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as 7 = I and e < 

Aspa{x*) —poi{x) < G{x,y{x)), from Proposition |424l 

„1 —CK 


Pa 


ix*)-Paix) <Y^ 


X^ 


Wn 


i=i 


a — 1 


1 + (1 + 3£:)(q; — ~ 


E 


Wk 


k^{S+US-} 

+ E 

/e{S+uS-} 


a 


a — 1 


Wl 


, 1 —a 


a — 1 


l + (l + 3e)(a-l)efc-a4“ 


l + (l + 3e)(a-l)6-a6 “ • 


From Proposition 14.251 


E 


Wk 


k^{S+US-} 


a 


a — 1 


1 + (1 + 3e)(a - 1)4 - a4“ ) < e(3a - 2) ^ 


Wk 

k^{S+\JS-} 

„1 —Q 


^ 1 —a 


a — 1 


<e{3a-2)Y, 


Xa 


Wi 


a — \ 


(55) 


i=i 

= e(3a - 2)(-pa(2;)). 

X Q: —1 \ 

Observe 'Ei&{s+us-} ( 1 + (1 + 3e)(Q! - 1)^/ - “ ] ■ Since a > 1, each wi-^ > 0, and 


therefore: 


A—a 


^ il + {1 + 3e){a - 1)^1 - a^i 

ie{s+us-} ^ 


< 


E 


Wl 


ze{s+us-} 


E 


Wl 


ze{s+us-} 


a 


a — 1 


^ 1 —a 


a — 1 


((1 + 3e)(a — 1)^/ + 1) 


(l+3.)(a-l)fLSEEEE + i) 

Wl ) 


(l + 3e) ^ xi'^yi{x)Aii+ ^ wr 


A—a 


a — 1 

iG{s+us-} i=i ie{s+us-} 

Now, from stationarity Eze{S+us-} YAi yi{x)Ai < 7 Ej=i WjX^^^ and using ([Slll we get: 


E 


Wl 


a 


a — 1 


1 + (1 + 3e)(a — 1)4 — Aj “ j < y~] w 


X 


^l-a 
3 

A — i 
3e 


( 7(1 + 3e)(Q; — 1) + 57 ) 


ze{S+u5-} ^ ^ i=i 

/.tp \ 

ej . (56) 

Finally, combining (1551) and (l56l) : pa{x*) — Pa{x) < e(4a — 1 )(—Pq,(x)). □ 

The following two lemmas are used for lower-bounding the potential increase in non-stationary rounds. 
Lemma 4.27. Consider any non-stationary round that happens after the initial tq + ri rounds, where 
To = ^ ln(l/(5min), T"! = ^ lii(nAmax)- Let x^ and x^ denote the values of x before and after the round 

update. If^YliVi^^) > > then ^{x^) - (l>{x^) > 0 ( 7 ^)(- 4 >(x°)). 
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Proof. Observe that as ^ y{X^) > ~ Z] 


W) 


oa-o 




-$(xo) < 2 • - '^ViX^) < 


,o^ ^ 2(1 - 36) 


'^x^j^yi{x^)Aij, 
j=l i=l 


where the last inequality follows from Lemma l46l 

Since the round is not stationary, we have that either: 


1 - T.jGS-us+x'jT.iyiix)Aij > 


or 


2. (1 - 27)e;=i > EU 1 

Case 1: Ej&s-us+ E* Vi{x)Aij > 7 E'"j=i Wj{x^f-°^. If: 


n m 


then 


^ ^ wll”) < (1 + 27 ) 

i=i *=i i=i 

m m 

2/iX] = ^( 7 ) Z] y* 

j&s-yjs+ i i=i *=i i=i *=i 

and, from the first part of Lemma l4.22[ the potential increase is lower bounded as: 


$(xi) - $(x°) > 0(/372) yi{x^)A 

j=l i=l 

= ni^Kj^)i-^x^)) 

= Q{E){-Hx^)). 




On the other hand, if: 


n m 


j=l i=l j=l 

then, from the second part of Lemma 14.221 

m 

4>(x^) - ^(x^) > 0(/37) E E 

j=l i=l 

= O(/37k)(-$(x0)) 

= ^(7^)(-^(a;°))- 

Case 2: (1 — 27 ) E]=i '^j(xj)^~°' > J2J=i Xj EI^i yi(x^)^ij- Then, using the third part of Lemma 14. 22 1 

/3 


4>(x^) - 4'(x°) > n E E 


j=l i=l 
m 

= ^ {h) yiix^)Aj 

j=l i=l 

= 0(/37«:)(—<h(x°)) 

= 0(7')(-<h(x°)), 

where in the second line we have used that = ©(/?)■ This can be shown using the generalized 

Bernoulli’s inequality and ea < ^ as follows: 

1 (y. E ^ 

- -— > (1 - 2/3)“ > 1 - 2a/3 = 1 - -->1-= 0(1). 

(l + /3)““^ ' ~ k + a 100 
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□ 


Lemma 4.28. Consider any non-stationary round that happens after the initial tq + ri rounds, where 
To = ^ ln(l/5min), Ti = ^ lii(n 74 max)- Lgf and denote the values of x before and after the round 

update. If^ y{X^) < “ Ylj '^3 ^^i-a ’ > Q (/37^) (a — 1 )(—‘5(x°)). 

Proof Observe that as ^ y(X^) < — Wj^- 


-$(xo) < - 2 ^' 


'^3 1-c 




1—a 


\ — a a — \ 

3 3 

From the definition of a stationary round, we have either of the following two cases: 

Case 1: Z^jgjs+us-} Vi{x)Aij > 7 From the first part of Lemma l4~22l the in¬ 
crease in the potential is: <h(x^) — > 0 (/^T^) '^ 3 (^j)^~‘^’ 

the increase in the potential is at least: 

$(x^) - «>(x°) > n{p-f^){a - l)(-«>(x°)). 

Case 2: (1 — 27 ) 'Wjx]~'^ > Yl'j=i ^3 Y^lLi Using part 3 of Lemma l432l the increase in 


the potential is then ‘h(x^) — > 0 

as in the proof of Lemma 14.27 1 


/3 


( 1 +/ 9 )' 


Yyj=i'^jx] "■ Therefore, using that = Q(/3) 


$(x^) - 4>(x°) > 0(/37)(a - l)(-4>(x°)). 


□ 


Proof of Theorem 14.31 We will bound the total number of non-stationary rounds that happen after the initial 
"To + "^1 rounds, where tq = ^ ln(l/(5 m:Ti ), xi = ^ In(n^max)- The total convergence time is then at most the 
sum of To + Ti rounds and the number of non-stationary rounds that happen after the initial tq ri rounds, 
since, from Lemma 14.261 in any stationary round: p{x*) — p{x) < e(4a — l){—p{x)). 

Consider the non-stationary rounds that happen after the initial tq -|- ti rounds. As Xj E [6j, 1], Vj, it is 
simple to show that: 


W 

a — 1 


< 


E 


1 — 0 


Wi 


< 


W 


a — 1 a — \ 


2R 


a — 1 

■w “ n 


2(o-l)™a-l/l 2 q-1 

Hi -^max ? 


(57) 


and 

^ ^ mC* 

0 < — yi(x) < -< emC. (58) 

K K 

I 

,\ — OL .. 

Recall that <I>(x) = — ffj -^ yi{x) and that the potential 4>(a:) never decreases. 

There can be two cases of non-stationary rounds: those in which Yj '^j a_i dominates in the absolute 
value of the potential, and those in which T Yi yi{x) dominates in the absolute value of the potential. We 
bound the total number of the non-stationary rounds in such cases as follows. 

1 — a 

Case 1: ^ Ylj • From (l57]) and (l5^ . in any such round, the negative potential is 

bounded as: 

< O (emC). 

Moreover, from Lemma 14.271 in each Case 1 non-stationary round, the potential increases by at least 
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n( 7 ^)(—‘I’(x)). It immediately follows that there can be at most: 




emC 

w 

, a-l , 


= O ( ^ In ((a - l)ei2^nm^max) 


— O ( o In (^R^nfnAjj^^x) 


( 59 ) 


Case 1 non-stationary rounds, as (a — l)e < ae < 

1 — a 

I _ 

a—1 


10 * 


Case 2: < Ylj (I57] ) and (l58l) . in any such round, the negative potential is 

bounded as: 




W 

a — 1 


< —<h(x) < O 


W 

a — 1 




Moreover, from Lemma l4.211 in each such non-stationary round the potential increases by at least Q (/3'y^) (a— 
1)(—<h(x°)). Therefore, there can be at most: 


O 


/3j^(a - 1 ) 


In 


w 

a-l 


= o 


Pr 

1 


■ ln(i?^ a nmA„ 


= 0 { ^ ln(i?^nmAmax) In R 


nmAr, 


e 

(60) 


Case 2 non-stationary rounds. 

The total number of initial tq -|- ti rounds can be bounded as: 

To + Ti = ^ln(l/(5 

min ) -h ^ \n{nA 

max) 

= O In (i?^nmAmax) In ^ 

Combining (l59l) . (l60l) . and (IhTl) . the total convergence time is at most: 


nmAr, 


(61) 


nmAr 


O In {Rr, ■ nmAraax) In yR^ 

Finally, running o-FairPSolver for the approximation parameter e' = ej (4a — 1), we get that in any 
stationary round Paix*) — Pa{x) < —£pa{x), while the total number of non-stationary rounds is at most: 

O In {Ruj • nm^rnax) In 

□ 


4.4 Structural Properties of a— Fair Allocations 

Lower Bound on the Minimum Allocated Value. Recall (from Section |2l) that the optimal solution x* 
to {Pa) must lie in the positive orthant. We show in Lemma 14.291 that not only does x* lie in the positive 
orthant, but the minimum element of x* can be bounded below as a function of the problem parameters. 
This lemma motivates the choice of parameters 6j in a-FAiRPSOLVER (Section |3]l. 

Lemma 4.29. Let x* = (x|,..., x* ) be the optimal solution to {Pa)- Then Vj G {1, ..., n}: 


• ^ if0<a< 1, 

• X* > if a > 1, 

where ni = i^ number of non-zero elements in the row of the constraint matrix A, 


^With the abuse of notation, l{e} is the indicator function of the expression e, i.e., 1 if e holds, and 0 otherwise. 
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and M = min{m, n}. 
Proof. Fix a. Let: 






A {l-a)/a 
-^ma-y 


i-‘ ^11 

ijoL 


.M 


l/a 

1 


1 


if a < 1 
if a > 1 


For the purpose of contradiction, suppose that x* = (x^, is the optimal solution to {Pa), and 

X* < Hj{a) for some fixed j G n}. 

To establish the desired result, we will need to introduce additional notation. We first break the set of 
(the indices of) constraints of the form Ax < 1 in which variable Xj appears with a non-zero coefficient into 
two sets, U and T: 


• Let U denote the set of the constraints from {Pa) that are not tight at the given optimal solution x*, 
and are such that Au^j / 0 for u G (7. Let = 1 — ^uk^k denote the slack of the constraint 

ueU. 


Let T denote the set of tight constraints from {Pa) that are such that Atj / 0 for t G T. Observe that 
since x* is assumed to be optimal, T / 0 . 


Let e, = min 


{pj(a) 


— mm, 


Su/Auj'^. Notice that by increasing Xj to x* 


+ Ej none of the 


constraints from U can be violated (although all the constraints in T will; we deal with these violations in 
what follows). 


In each constraint t £ T, there must exist at least one variable Xk such that xl > 


1 


ntAtk 


, because 


:. Select one such Xk 


Ya=i Atlx] = 1, as each f G T is tight, and x* < Hj{a) < mini:^,^.^o 
in each constraint t £ T, and denote by K the set of indices of selected variables. Observe that \K\ < \T\ 
(< M), since an Xk can appear in more than one constraint. 

For each k £ K, let denote the constraints in which Xk is selected, and let 

Ek = max (62) 

If we increase Xj by Ej and decrease by VA: G K, each of the constraints t £ T will be satisfied since, 
from (l62l ) and from the fact that only one Xk gets selected per constraint t £ T, ejAtj — YhkEK ^k^tk < 0. 
Therefore, to construct an alternative feasible solution x', we set x'- = x* + Ej, x'^ = x*^ — Ek for k £ K, 
and x) = X* for all the remaining coordinates I £ {1,..., n}\(iT U {j}). 

Since j is the only coordinate over which x gets increased in x', all the constraints Ax' < 1 are satisfied. 
For x' to be feasible, we must have in addition that x'^ > 0 for A: G iT. We show that x'^ = x|, — > 0 as 

follows: 


Ek = Ej ■ max —— 

teTk'.Atkj^o Atk 


< Hj{a) 


Atj 

max — 
t&Tk'.AtkJ^O Afk 


< 


A 


mm —— 

i'.Aij^o riiAi 


max 

tETk-.Atk^Q Atk 


< 


1 A 


max 


tj 


< 


teTk-.Atk^o ntAtj Atk 

1 


max 


t&Tk-.Atk¥=o ntAtk 

< xl, 
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where the second line follows from sj < fij{a) — x* < ^jLj{a), and the last line follows from the choice of 


Xk- 


The last part of the proof is to show that > 0, which contradicts the initial assumption 

that X* is optimal, by the definition of a-faimess from Section|2] We have that: 


^wi 

1=1 


Xl-Xi 


= w. 


iz.j G/j 

rp*a / J rp*Q 


k&K 


E 

keK 

E 

k&K 


W 


^x*°‘\K\ 


- Wk- 




x^j^jxT ~ WkekX*°'\K\ 
x*^xl^\K\ 


(63) 


f G T such that = 


Atk 


Consider one term from the summation (I63]) . From the choice of e^-’s, we know that for each there exist 


and at the same time (by the choice of Xk) we have x\ > 


1 


^tAtk 


so that 


WjEjXj^ > Wj 


^kAtk ^ A ^ '^k^j^k Aik f 1 


A 


tj 


AtkXlt) 


> 


(64) 


'^max Aij yAikTlt^ 

Case 1. Suppose first that a < 1. Then / 0 ^ Atk > 1. Plugging into 

I, we have: 

WkWjEk 1 


* a ^ 
WjEjXk > 


Wr, 


'kit At j 


By the initial assumption, x*, < iXj{a) = 


M mAii 


1/a 


and therefore 


*ai 7^1 ^ yJk'Wjek \K 
WkEkXj \K \ < - - -mm 


Wr, 


M iiAij^o riiAi 


1 ^ WkWjEk _1 


Wr, 


wtAtj 


(65) 


( 66 ) 


since it must be |iT| < M (= min{m, n}). From (1651) and (l 66 l) . we get that every term in the summation 
(| 6 ^ is strictly positive, which implies: 


Y^wi 

1=1 


Xl Xi 


X 


> 0 , 


and therefore x* is not optimal. 

Case 2. Now suppose that a > 1. Then 


< IJj{a) = ^max^^ min < ^max^^ 

\WmaxMj r.Aij^OmAi- 


Therefore: 


U) ■ 

WkEkX*A\K\ < WkEk - ^-I^A II 

■’ rUmaxM 




1—a 


1 


< Wk 


= Wk 


< Wk 


'X’i . 

— ^—A 


1—a 

max 


A , 1^1 

niAij 

1 \"Atk^ 


Wrriax \Atknt J Aij 

Wj EkAtk {AtkjAijY-^ ( 1 


'U^max Aij 


A ““1 

^ma,x 


W 


J 


WmnxM 


l/a 




Atknt 


Wj ^k-^tk ( 1 


'^max y^tk'^t 


( 67 ) 


as |iT| < M, and ^ "^max (since for any i, j: 1 < Aij < ^max)- 

*3 


-il 



































Finally, from (l64l) and (l67l) we get that every term in the summation (l 6 ^ is positive, which yields a 
contradiction. □ 

Asymptotics of a— Fair Allocations The following lemma states that for sufficiently small (but not 
too small) a, the values of the linear and the a— fair objectives at their respective optimal solutions are 
approximately the same. This statement will then lead to a conclusion that to e—approximately solve an 
a— fair packing problem for a very small a, one can always use an e— approximation packing LP algorithm. 
Lemma 4.30. Let {Pa) be an a— fair packing problem with optimal solution x*, and (Pq) be the LP with 
the same constraints and the same weights w as (Pa) and an optimal solution z*. Then if a < /gp 

we have that 'ff- wjZj > (1 — 3e) Ylj ^ , where e G (0,1/6]. 

Proof The proof outline is as follows. First, we show that the a—fair objective Pa{x*) can be upper- 

bounded by a linear objective as Pa{x*) = Ylj '‘^j {-a — (^ + 0(e)) WjX*. Then, to complete the 

proof, we use the optimality of z* for the LP: WjZ* > WjX* (> (1 - 0(e)) from the 

first part of the proof). 

Let g(xj) = -(1 + e)xj. Consider the case when g(xj) < 0. Solving g(xj) < 0 for Xj, we get 

that it should be 

1 \ i/« / 1 \ i/« 

' ' . ( 68 ) 


Xj > 


1 — a 


1+6 


Choose a so that 


Thentohave^(xj) < 0, 


— () > which is equivalent to a < 
it suffices to have Xj > —, because (i) G [e,4] for a G [0,1/2], where e is the base of the 

natural logarithm, and (ii) , ^.i/^ < () by the choice of a. 


Now, as a < 


ln(l+£) 

ln(4nkmax/e) 


, summing over j such that x* > 


nAn 


E 

o-r*>_ - 

^■^3-nAn 


\ 1—a 


W 




WjXj = 


j:x *>—- j:x *>— 

-J •? — nAmax '' 3 — nA] 


we have: 

^ Wjg(x*) < 0 (69) 

Observe that since 


‘imax 3 — 'H.^imax 3 — n^max 

Now we bound the rest of the terms in pa(x*), i.e., we consider j : x*, < —f —. 

^ forj = {l,..., n} is a feasible solution to (Pa) and x* is the optimal solution to (Pa), we have 

-< Ei Wj , which gives: 


^3 = TJ- 

that^.mEEA 


'^3 


E 


(^•) 


* 'i 1—a 




1 — a 


< e 


1 — Q. 


E 


(l/nA„ 


\ 1—a 


< 6 


1—a 


r-^j<TtA: 


1 — a 


E 


w 


max 

* \ 1 —a 

P 


3 


1 — a 


< 


2^E 


i=i 

n 

Wi 


i=i 


1 — a 


Therefore: 


E 

J — nAn 


Wi 


(Xi) 


* ^ 1—a 

j 


1 — a 


>(l-2e)5^ 


Wi 


1=1 


1 — a 


(70) 
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Combining (1^ and (TTOI) . we now get: 


n 


E 




(*•) 


,* \1 —Q 


1 — a 


< 


1 + 6 
1 - 2e 


3 — ^-Amax 


(71) 


Finally, since z* optimally solves (Pq) (which has the same constraints and weights as {Pa)), we have that 
X* is feasible for (Po)^ and using (ItTI) and optimality of z*, it follows that: 


i=i i=i 

1 - 2e " 


> 




l + e I-a 

3 = 3 


(x*) 

> (1 - ^ 


,* \ 1 —a 


i=i 


— a 


as claimed. □ 

Observing that for any a € (0,1), > z* (since, due to the scaling, z* € [0,1]), a simple 

corollary of Lemma 14.301 is that an e—approximation z to (Pg) (Ylj Ylj "^jZ*) is also an 

0(e)—approximation to (Pa), for a < /g) - Thus, to find an e—approximate solution for a < 

\n{nA^ je) ’ packing LP algorithm of fTl can be run, which means that there is a stateless distributed 
algorithm that converges in po\y{hi{e~^RyjVfinA-^^y^)/e) time for a arbitrarily close to zero. 

The following two lemmas show that when a is sufficiently close to 1, {Pa) can be e—approximated by 
e—approximately solving (Pi) with the same constraints and weights. 

Lemma 4.31. Let x be an e—approximate solution to a 1-fair packing problem (Pi) returned by a- 
FairPSOLVER. Then, for any a G [1 — I/tq, 1), where Tq = ^ ^ ^ 2e—approximate 

solution to {Pa), where the only difference between (Pi) and {Pa) P in the value of a in the objective. 


Proof Suppose that x is a solution in some stationary round, provided by a-FAiRPSOLVER run for a = 1. 
Fix that round. It is clear that if x is feasible in (Pi), it is also feasible in {Pa), since all the constraints in 
(Pi) and {Pa) are the same by the initial assumption. All that is required for a dual solution y to be feasible 
is that Pi > 0, for all i, and therefore y{x) is a feasible dual solution for {Pa)- The rest of the proof follows 
by bounding the duality gap Ga{x, y{x)). Recall from Q that: 


Ga{x,y{x)) = '^Wj 

3=t 


—Q! 


1 — a 


Xi 


l^i=l yi^lj \ a 


Wi 


- 1 


+ ^yi 

i=l 


E 

3 = 3 


WjXj 


1—a 




(72) 


Since x is a solution from a stationary round, from the second part of the definition of a stationary round 
(Definition |447]), we have that: 

n n n 

yi{x)Aij < (1 + 27 ) ^ Wk- 

j=l i=l k=l 

Further, from Lemma l4~6l 

m n n n 

'^yi{x) < (1 + 3e)'^Xj'^yi{x)Aij < (1 + 3e)(l + 2y)'^Wk. (73) 

2 = 1 j = l 2=1 k = l 

Next, we show that: 

>1-7, Vj. (74) 


39 


















Rearranging the terms and taking logarithms of both sides in (1741) . we obtain the equivalent inequality 
1 — a < Recall from a-FAiRPSOLVER that in every (except for, maybe, the first) round 

Xj > Sj > <5min- As ln(l/(l — 7 )) > 7 , it therefore suffices fo show fhaf 1 — a < . ) ■ Buf from fhe 

sfafemenf of fhe lemma, 1 — q:< 1 /to< . y complefing fhe proof of (T/dl) . 

Combining (17^ and (1741) . we gel fhaf: 




(1 + 5e) 


WiX 


1 — Q. 


3^3 ’ 


(75) 


i=l j=l 

where fhe second inequality follows from s < 1/6, 7 = e/4. 
Using (1751) . we can bound fhe duality gap (Eq. (1721) 1 as: 

^ l-a / .^asr^rn „ A. 


j=i 


Ga{x,y{x)) < ^ 


Wi 


1=1 


1 — a 


a 


Wi 


— 1 + (1 — Q:)(l + 5e) 


(76) 


To complele fhe proof, recall from Lemma l4.19l fhaf in any round of fhe algorifhm, for all j: ^ > 

(1 — 7 )^°. As a < 1 and xj G [0,1], Vy, if holds fhaf Xj" > Xj, Mj, and Iherefore: 

3 ^ 1 “ Ta=i yi{x)Ai 


Wi 


Vj. 


Finally, recalling fhaf 1 — a < I/tq, and combining (177]) wifh (17^ . we gef: 

72 ^1 1 / \ 

Gaix,y{x)) < ^ Wj ^ - 1 + (1 - a)(l + 5e)J 


1—a 


^ -((1 + 27 )^/“ - 1 + (1 - a)(l + 5e)) 

1 — a 
j=i 

1—a 

‘'1 




1=1 


X 

1 — a 


(1 + e — 1 + (1 — a) (1 + 5e)) 


< 


E X' 

Wjj 


1=1 


1—a 


— a 


(77) 


where fhe fhird inequality follows from a>l — l/ro>l — ^>1 — |g, and fhe fourfh inequality follows 
from 1 — a < e /2 and e < 1 / 6 . □ 


Lemma 4.32. Let x be an e—approximate solution to a 1-fair packing problem {Pi) returned by a- 
FAIRPSOLVER. Then, for any a G (1,1 + l/rol, where tq = 4ln(x^), x is also a 2£—approximate 
solution to {Pa)’ where the only difference between (Pi) and {Pa) A in the value of a in the objective. 

Proof Similar fo fhe proof of Femma l4.31[ we will fix an x from some sfafionary round of a-FAiRPS OLVER 
run on (Pi), and argue fhaf fhe same x 2 e—approximafes {Pa) by bounding fhe duality gap Ga{x,y{x)), 
allhough we will need fo use a differenl sef of inequalities since now a > 1. Similar fo fhe proof of Lemma 
14.311 as x is (primal-)feasible for (Pi), x and y{x) are primal- and dual-feasible for (Pa). 

By fhe same token as in fhe proof of Lemma l4.31l 

m n 

^ yi{x) < (1 + 3e)(l + 27 ) ^ Wj. 

i=l j=l 

As a > 1 and xj G (0,1], Vj, we have fhaf xp““ > 1, Vj, and Iherefore: 

m n n 

^ 2 /i(a;) < (1+ 3e)(l+ 27 ) < (1 + 4e) ^ (78) 

i=l j=l j=l 
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Therefore, we can write for the duality gap: 

l-Q / , ^ a. 

«-^ / ' / r . X 

j = l 

\ — OL 


(.,!,(.)) s E -1+(1 - «)(i+4.)) 

Ya=\ 


(79) 


E X 


f=i 


— a 


a ... A ..^ Hn-l 

_ V ) “ 

Wn 


+ 1 + (q; — 1)(1 + 4e) ] . (80) 


Notice that, as a > 1, the objective for (Pa), ^2]=! negative. 

Using the same arguments as in the proof of Lemma 14.311 it is straightforward to show that Xj°‘~^ > 
1 — 7 , Vj. From Lemma 14.191 we have that ^4 VziPA] > _ 7 )^ 0 , Vj, and therefore: 


YT=i yiix)Aj _ Xj^ “ • Xj yT=i yi{x)A 


Wj Wj 

Recalling that a — 1 < I/tq (by the statement of the lemma) and using (IMl) . we have: 

(X V^?n A a — l 

Z^i=iyi^ij\ « ^ _ ^^(ro + l)/(ro(l+l/ro)) 


(81) 


Wi 


= (1-7)- 


(82) 


Finally, plugging (l82l) into (l80l ). we have: 

n 

Ga{x,y{x)) 


1—a 


J=1 

n 

E' 

i=i 


1 — a 


(-a(l - 7 ) + 1 + (a - 1)(1 + 4e)) 


— - { a ■ -e + 4e(a — 1) 

1 — a V 4 


< 

where the equality follows from 7 


—e 


E 

i=i 


1—a 


1 — a 


= |, and the last inequality follows from a — 1 < — < 


£ 

20 * 


(83) 

□ 


Finally, we consider the asymptotics of a— fair allocations, as a becomes large. This result complements 
the result from ll3^ that states that a— fair allocations approach the max-min fair one as a 00 by showing 
how fast the max-min fair allocation is reached as a function of a, Rw,n, and First, for completeness, 

we provide the definition of max-min fairness. 

Definition 4.33. (Max-min fairness / liOl/ . ) Let TZ C MIJ: be a compact and convex set. A vector x G TZ is 
max-min fair on TZ if for any vector z GTZ it holds that: if for some j G {1,..., n} Zj > Xj, then there exists 
k G {1,..., n} such that Zk < Xk and Xk < Xj. 

On a compact and convex set TZ C the max-min fair vector is unique (see, e.g., iilSSl). The 
following lemma shows that for a > ln(i?„,nAmax)> the a— fair vector and the max-min fair vector 
are e—close to each other. Notice that because of a very large gradient of Pa{x) as a becomes large, the 
max-min fair solution provides only an 0(ea)— approximation to (Pa)- 

Lemma 4.34. Let x* be the optimal solution to (Pa) = max{pa(3:) : Ax < l,x > 1)}, z* be the max- 
min fair solution for the convex and compact set determined by the constraints from (Pa)- Then if a > 
e~^ In (RyjuA^ax), we have that: 


1. Pa(x*) < (1 — e(a — l))pa(z*), i-e., z* is an e(a — 1)—approximate solution to (Pa), and 


2. (1 - e)z* < x* < (1 -h e)z*,forall j G {1, ...,n}. 
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Proof. Suppose that, starting with z*, we want to construct a solution 2 ; that is feasible in {Pa) and is such 
that Pa{z) > Pa{z*). Then we need to increase at least one coordinate j of z*. Suppose that we increase a 
coordinate j by a factor 1 + e, so that Zj = {l + s)zj. Since 2 * is the max-min fair vector, to keep 2 ; feasible, 
the increase over the coordinate must be at the expense of decreasing some other coordinates k that 
satisfy 4 < We will assume that whenever we decrease the coordinates to keep the solution feasible, 
we keep the solution Pareto optimal (i.e., we decrease the selected coordinates by a minimum amount). 
Using Fact l4.81 we have: 


Pa{z) -Pa{z*) < 


n 


1=1 


Zi - z*i 

(44 


< Wj 



Wj{z*) 


* ^ 1—Q 

j ' 


(84) 


Now, suppose that we want to further increase the j**’ coordinate by some small 6. Call that new solution 
z^. Then, the total amount by which other coordinates must decrease to keep the solution feasible is at 
least , since the feasible region is determined by packing constraints and it must be Az < 1, where 
1 < Aij < Vi,y. Moreover, since 2 ;* is max-min fair, each coordinate k that gets decreased must 

satisfy 2 ;^ < 2 ;*. It follows that: 

Piz4-p{z) < 


6 

on ■ _ 

^(1 + e )“( 2 p “ 


+ 


E 

k:zl<zi, 


Wk 


4-^k 

i^k)^ 


< W 


'^max 


(l + e)“(2:p 
(5(mmax - {l+e)°‘w 


- W„ 


4K 


(4)^ 




(l + e)“(2*)^ 


< 0 . 


(85) 


The last inequality can be verified by solving the inequality rumax ~ (1 + e)“w^min/^max < 0 for a, and 
verifying that it is implied by the initial assumption that a > e~^ ln(R,„nA rr,» A. 

Therefore, the maximum amount by which any coordinate of 2 ;* can be increased to improve the value 
of the objective Pai-) is by a multiplicative factor of at most (1 + e). Since we can construct x*, the 
optimal solution to (Pa), starting with 2 ;* and by choosing a set of coordinates j that we want to increase 
and by only decreasing coordinates k such that 4 < 4 whenever coordinate j is increased, it follows that 
X* < (1 + e)z*, Vj. 

Moreover, from (IMl) and (1^ : 

Pa{z^) - Pa{z*) =p{z4-piz) + piz) - p{z*) < £ • (z*)^"", 

and we can conclude that: 

n 

Pa{x*) -Pa{z*) < • Wj{z*)^~'^ = e(l - a) -Paiz*), 

i=i 

which means that z* is an e{a — 1)—approximate solution to (Pa)- 

Now consider the coordinates we need to decrease when we construct a solution 2 ; from 2 ;*, such that 
Pa{z) > Pa{z*). Suppose that to increase some other coordinates, a coordinate k is decreased by a factor 
(1 — e): Zk = (1 — e) 2 ^. As 2 ;* is max-min fair, only coordinates larger than 2 ;^ can increase at the expense of 
decreasing 2 ^. Suppose now that we decrease the A:'*’ coordinate further by some small 6 . Call that solution 
z^. Then the maximum number of other coordinates j that can further increase is min{n — l,m} < n. 
Moreover, each coordinate j that gets increased satisfies > 4 ^ and can be increased by at most 
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Using Fact l4.81 it follows that: 


Pa{z^) -Pa{z) < 


< 


< 

where the last inequality follows from (1 — e)“ < {RwnAma.x)~^, which is implied by the initial assumption 
that a > e“^ln(i?^nAmax)- 

Therefore, using (IMl) . the coordinate can decrease by at most a multiplicative factor (1 — e). Using 
similar arguments as for increasing the coordinates, it follows that Xj > (1 — £)zj,yj. □ 

5 Conclusion 

We presented an efficient stateless distributed algorithm for the class of a-fair packing problems. To the best 
of our knowledge, this is the first algorithm with poly-logarithmic convergence time in the input size. Addi¬ 
tionally, we obtained results that characterize the fairness and asymptotic behavior of allocations in weighted 
a—fair packing problems that may be of independent interest. An interesting open problem is to determine 
the class of objective functions for which the presented techniques yield fast and stateless distributed al¬ 
gorithms, together with a unified convergence analysis. This problem is especially imporfanf in lighf of 
fhe facl fhat a-fair objectives are nof Lipschifz confinuous, do nof have a Lipschifz gradienf, and fheir dual 
gradienf’s Lipschifz consfanf scales af leasf linearly wifh n and Amax- Therefore, fhe properties fypically 
used in fasl firsf-order mefhods are lacking |l2l|40l. Finally, for applications of a-fair packing fhaf do nof 
require uncoordinated updates, if seems plausible fhaf fhe dependence on s~^ in fhe convergence bound can 
be improved from fo by relaxing fhe requiremenf for asynchronous updates, similarly as was done 
in lO over fJl. 
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"in(xj), if a = 1 
™l-a 

^ if a 7^ 1 


A Scaling Preserves Approximation 

Let the a-fair allocation problem be given in the form: 

n 

(Qa) niax| ^ Wjfaixj) : Ax < 6, x > o|, where /q 

7=1 I 1 ’ 

V 1 — q; 

w is an n—length vector of positive weights, x is the vector of variables, A is an n x m constraint matrix, 
and b is an m—length vector with positive entries. Denote Pa{x) = Wjfa{xj). 

It is not hard to see that the assumption hi = IMi is without loss of generality, since for / 1 we can 
always divide both sides of the inequality by hi and obtain 1 on the right-hand side, since for (non-trivial) 
packing problems bi ^ 0. Therefore, we can assume that the input problem has constraints of the form 
A - X <\, although it may not necessarily be the case that Aij > 1 / 0. 

The remaining transformation that is performed on the input problem is: 


'T* • - • 'Y' • 


where 

jmmij,A^^^^Aij, 

if Aij < 1 

F. 

otherwise 

The problem {Qa) after the scaling becomes: 

n 

n 

max ^ Wjfa{xj) • 

{Pa) max '^Wjfa{Xj 


i=i 

s.t. Ax < 1 

X > 0 




i=i 

s.t. Ax < 1 

X > 0, 


as c 


1 — OL 


is a positive constant. Recall that ck-FairPSolver returns an approximate solution to (F’a)^ and 


observe that x is feasible for [Qa) if and only if x is feasible for (Pa)- 

Choose the dual variables (Lagrange multipliers) for the original problem {Qa) as: 

y. = = c^-^C • 

and notice that 


(87) 


Xj" ^ ViAij = x“ • c “ • ^(c" ^ - Vi - c - Aij) ViAij. (88) 

i=l 1=1 2=1 

It is clear that y^’s are feasible dual solutions, since the only requirement for the duals is non-negativity. 

A.l Approximation for Proportional Fairness 

Recall (from (Illl) that the duality gap for a given primal- and dual-feasible x and y is given as: 

n n / m \ m 

G{x, y) = ^ Wj ln{wj) - ^ Wj In I Xj ^ yiAij | -h ^ y* - 1. 

j=l j=l \ i=l / i=l 

Since a = 1, we have that % = yi for all i, and using (IMI ). it follows that 

G(x,y) = G(x,y). 

Since we demonstrate an additive approximation for the proportional fairness via the duality gap: p{x*) — 
p{x) < G(x, y), the same additive approximation follows for the original (non-scaled) problem. 

A.2 Approximation for a-Fairness and a ^ 1 

For a / 1, we show that the algorithm achieves a multiplicative approximation for the scaled problem. In 
particular, we show that after the algorithm converges we have that: Pa{x*) — Pa{x) < raPa{x), where x* 
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is the optimal solution, x is the solution returned by the algorithm, and is a eonstant. 

Observe that sinee x = c ■ x, we have that Pa(x*) = and Pa{x) = c^““pq(x). Therefore: 

Pa{x*) -Pa{x) = C‘^~\pa{x*) - Pa{x)) 

< C“"^ • TaPaix) 

= TaPaix). 

B Primal, Dual, and the Duality Gap 


B.l Proportionally Fair Resource Allocation 

In this section we consider [w, 1)-proportional resource allocation, often referred to as the weighted propor¬ 
tionally fair resource allocation. Recall that the primal is of the form: 

n 

(Pi) max '^Wjlnixj) 
i=i 

s.t. Ax < 1, 

X > 0. 

The Lagrangian for this problem can be written as: 


Li(x; y, z) = ^ Wj ln{xj) + X] “ X] ’ 

j=i i=i \ j=i ) 

where yi, ...,ym are Lagrange multipliers, and zi,Zm are slack variables. The dual to this problem is: 

(L»i) min g{y) 
s.t. y >0, 

where giy) = mayix,z>o Lix;y, z). To maximize Li(x;?/, z), we first differentiate with respect to Xj, 
j G n}: 

,,.4y = o, 

0X4 X4 ^ 

J J 

which gives: 

m 

Xj ■ ^ ViAij = Wj, Vj G {1,..., n}. (89) 

i=l 

Plugging this back into the expression for Li(x; y, z), and noticing that, since yi, z* > 0 Vf G {1, 

Li(x; y, z) is maximized for Zi = 0, we get that: 


m m n 


-=1 z^fc=i 


AijWj 


h=l ykT^kj 


7 = 1 7 = 1 \i=l / 2=1 7 = 1 2=1 


m n m 


r=l yk^kj 


= Y - Z] + Z 

j=l j=l \i=l / i=l 

since YT=i jy^'^^mAk ■ ^ ^ 

Let Pi (x) = Y^j^iWj ln(xj) denote the primal objective. The duality gap for any pair of primal-feasible 
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Ui^ij 


+ '£yi-W. 


i=l 


X and dual-feasible (nonnegative) y is given by: 

Gi{x,y) = gi{y) - pi{x) 

n 

= -^^0 In 

j=i 

Since the primal problem maximizes a concave function over a polytope, the strong duality holds ifT^ . and 
therefore Gi{x,y) > 0 for any pair of primal-feasible x and dual-feasible y, with equality if and only if x 
and y are primal- and dual- optimal, respectively. 

B.2 a-Fair Resource Allocation for a 7 ^ 1 

Recall that for a / 1 the primal problem is: 


(Pa) max 


X 




1—a 

j — 


i=i 

s.t. Ax < 1, 

X > 0. 

The Lagrangian for this problem can be written as: 

n ^l—(y m 


1 — a 


= Pa{x) 


La{x-,y,z) = 


Wn 


3 


^ 1 — a 

j=i 1=1 


+ I i-JZa 

i=i 


where yi and Zj, for i € {1,m}, are Lagrangian multipliers and slack variables, respectively. 
The dual to {Pa) can be written as: 

{Da) min g{y) 
s.t. y > 0, 

where ga{y) = maXa;, 2 >o Lq,(x; y, z). 

Since La{x] y, z) is differentiable with respect to Xj for j G {1,..., n}, it is maximized for: 

dLa{x;y,z) _ Wj 


dxn 


As Zi ■ yi > 0'\/i ^ {1,..., m}, we get that: 


Xi 


Wj = Xj 


^ ^ Ui-Aij — 0 
i=l 
m 




2 = 1 


m n 




1/a 


Ui-c^\ET=imA. 


m 




Wi 


Wi 


- i=l 

m 


+ Yy^-Y^ 


Ija 


fc=l VkAkj 

/ m \ ~1/^ m 

^ ^ Uk^kj I ^ ^ 


j=l 


Kk=l 


2=1 




Similarly as before, for primal-feasible x and dual-feasible y, the duality gap is given as: 

Ga{x,y) = ga{y) -Pa{x) 


(90) 
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m n 


+ Y.y^- - Y 


^ x]-^ 

- > Wj ^ 


i=l j=l 


n ^1-a 
Wj^- 


Y 


Wi 


^ 1-a y\xj<^^Ziyi^ 


V 


\i=l 


m n 


i=i 


1 — a 


1 +Yy^-Y^ 

) i=\ j=i 


Ija. 

j 


a — l 

m \ ct 




Observing that: 


1/a 


y/ ^ijVi 


-Szli i-a “■ 
= Wj-Wj « -Xj -X- 


y/ ^ijVi 


\i=l 


= WjX. 


1-a _ ^ X]t=l 

Wj ) 


^ 2=1 
q: —1 


we finally get: 


n ^1—a 


Ga{x,y) = Y'^^l 
i=i 


f^rT,T=ly^A^, 


— a 


^ 1-0. (<YT=lA,y^\ 

A+Yy^-Y^^A ■[-—) 


i=i j=i 
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